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CGGCAGCCCTGACGTGATGAGCTCAACCAGCAGAGACATTCCATCCCAAGAGAGGTCTGCGTGACGCGTCCGGGAGG 
CCACCCTCAGCAAGACCACCGTACAGTTGGTGGAAGGGGTGACAGCTGCATTCTCCTGTGCCTACCACGTAACCAAA 
AATGAAGGAGAACTACTGTTTACAAGCCGCCCTGGTGTGCCTGGGCATGCTGTGCCACAGCCATGCCTTTGCCCCAG 
AGCGGCGGGGGCACCTGCGGCCCTCCTTCCATGGGCACCATGAGAAGGGCAAGGAGGGGCAGGTGCTACAGCGCTCC 
AAGCGTGGCTGGGTCTGGAACCAGTTCTTCGTGATAGAGGAGTACACCGGGCCTGACCCCGTGCTTGTGGGCAGGCT 
TCATTCAGATATTGACTCTGGTGATGGGAACATTAAATACATTCTCTCAGGGGAAGGAGCTGGAACCATTTTTGTGA 
TTGATGACAAATCAGGGAACATTCATGCCACCAAGACGTTGGATCGAGAAGAGAGAGCCCAGTACACGTTGATGGCT 
CAGGCGGTGGACAGGGACACCAATCGGCCACTGGAGCCACCGTCGGAATTCATTGTCAAGGTCCAGGACATTAATGA 
CAACCCTCCGGAGTTCCTGCACGAGACCTATCATGCCAACGTGCCTGAGAGGTCCAATGTGGGAACGTCAGTAATCC 
AGGTGACAGCTTCAGATGCAGATGACCCCACTTATGGAAATAGCGCCAAGTTAGTGTACAGTATCCTCGAAGGACAA 
CCCTATTTTTCGGTGGAAGCACAGACAGGTATCATCAGAACAGCCCTACCCAACATGGACAGGGAGGCCAAGGAGGA 
GTACCACGTGGTGATCCAGGCCAAGGACATGGGTGGACATATGGGCGGACTCTCAGGGACAACCAAAGTGACGATCA 
CACTGACCGATGTCAATGACAACCCACCAAAGTTTCCGCAGAGGCTATACCAGATGTCTGTGTCAGAAGCAGCCGTC 
CCTGGGGAGGAAGTAGGAAGAGTGAAAGCTAAAGATCCAGACATTGGAGAAAATGGCTTAGTCACATACAATATTGT 
TGATGGAGATGGTATGGAATCGTTTGAAATCACAACGGACTATGAAACACAGGAGGGGGTGATAAAGCTGAAAAAGC 
CTGTAGATTTTGAAACCGAAAGAGCCTATAGCTTGAAGGTAGAGGCAGCCAACGTGCACATCGACCCGAAGTTTATC 
AGCAATGGCCCTTTCAAGGACACTGTGACCGTCAAGATCTCAGTAGAAGATGCTGATGAGCCCCCTATGTTCTTGGC 
CCCAAGTTACATCCACGAAGTCCAAGAAAATGCAGCTGCTGGCACCGTGGTTGGGAGAGTGCATGCCAAAGACCCTG 
ATGCTGCCAACAGCCCGATAAGGTATTCCATCGATCGTCACACTGACCTCGACAGATTTTTCACTATTAATCCAGAG 
GATGGTTTTATTAAAACTACAAAACCTCTGGATAGAGAGGAAACAGCCTGGCTCAACATCACTGTCTTTGCAGCAGA 
AATCCACAATCGGCATCAGGAAGCCCAAGTCCCAGTGGCCATTAGGGTCCTTGATGTCAACGATAATGCTCCCAAGT 
TTGCTGCCCCTTATGAAGGTTTCATCTGTGAGAGTGATCAGACCAAGCCACTTTCCAACCAGCCAATTGTTACAATT 
AGTGCAGATGACAAGGATGACACGGCCAATGGACCAAGATTTATCTTCAGCCTACCCCCTGAAATCATTCACAATCC 
AAATTTCACAGTCAGAGACAACCGAGATAACACAGCAGGCGTGTACGCCCGGCGTGGAGGGTTCAGTCGGCAGAAGC 
AGGACTTGTACCTTCTGCCCATAGTGATCAGCGATGGCGGCATCCCGCCCATGAGTAGCACCAACACCCTCACCATC 
AAAGTCTGCGGGTGCGACGTGAACGGGGCACTGCTCTCCTGCAACGCAGAGGCCTACATTCTGAACGCCGGCCTGAG 
CACAGGCGCCCTGATCGCCATCCTCGCCTGCATCGTCATTCTCCTGGTCATTGTAGTATTGTTTGTGACCCTGAGAA 
GGCAAAAGAAAGAACCACTCATTGTCTTTGAGGAAGAAGATGTCCGTGAGAACATCATTACTTATGATGATGAAGGG 
GGTGGGGAAGAAGACACAGAAGCCTTTGATATTGCCACCCTCCAGAATCCTGATGGTATCAATGGATTTATCCCCCG 
CAAAGACATCAAACCTGAGTATCAGTACATGCCTAGACCTGGGCTCCGGCCAGCGCCCAACAGCGTGGATGTCGATG 
ACTTCATCAACACGAGAATACAGGAGGCAGACAATGACCCCACGGCTCCTCCTTATGACTCCATTCAAATCTACGGT 
TATGAAGGCAGGGGCTCAGTGGCCGGGTCCCTGAGCTCCCTAGAGTCGGCCACCACAGATTCAGACTTGGACTATGA 
TTATCTACAGAACTGGGGACCTCGTTTTAAGAAACTAGCAGATTTGTATGGTTCCAAAGACACTTTTGATGACGATT 
CTTAACAATAACGATACAAATTTGGCCTTAAGAACTGTGTCTGGCGTTCTCAAGAATCTAGAAGATGTGTAACAGGT 
ATTTTTT 
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AACTCAAACTCCTCTCTCTGGGAAAACGCGGTGCTTGCTCCTCCCGGAGTGGCCTTGGCAGGGTGTTGGAGCCCTCG 
GTCTGCCCCGTCCGGTCTCTGGGGCCAAGGCTGGGTTTCCCTC ATG TATGGCAAGAGCTCTACTCGTGCGGTGCTTC 
TTCTCCTTGGCATACAGCTCACAGCTCTTTGGCCTATAGCAGCTGTGGAAATTTATACCTCCCGGGTGCTGGAGGCT 
GTTAATGGGACAGATGCTCGGTTA7UVATGCACTTTCTCCAGCTTTGCCCCTGTGGGTGATGCTCTAACAGTGACCTG 
GAATTTTCGTCCTCTAGACGGGGGACCTGAGCAGTTTGTATTCTACTACCACATAGATCCCTTCCAACCCATGAGTG 
GGCGGTTTAAGGACCGGGTGTCTTGGGATGGGAATCCTGAGCGGTACGATGCCTCCATCCTTCTCTGGAAACTGCAG 
TTCGACGACAATGGGACATACACCTGCCAGGTGAAGAACCCACCTGATGTTGATGGGGTGATAGGGGAGATCCGGCT 
CAGCGTCGTGCACACTGTACGCTTCTCTGAGATCCACTTCCTGGCTCTGGCCATTGGCTCTGCCTGTGCACTGATGA 
TCATAATAGTAATTGTAGTGGTCCTCTTCCAGCATTACCGGAAAAAGCGATGGGCCGAAAGAGCTCATAAAGTGGTG 
GAGATAAAATCAAAAGAAGAGGAAAGGCTCAACCAAGAGAAAAAGGTCTCTGTTTATTTAGAAGACACAGAC TAA CA 
ATTTTAGATGGAAGCTGAGATGATTTCCAAGAACAAGAACCCTAGTATTTCTTGAAGTTi\ATGGAAACTTTTCTTTG 
GCTTTTCCAGTTGTGACCCGTTTTCCAACCAGTTCTGCAGCATATTAGATTCTAGACAAGCAACACCCCTCTGGAGC 
CAGCACAGTGCTCCTCCATATCACCAGTCATACACAGCCTCATTATTAAGGTCTTATTTAATTTCAGAGTGTAAATT 
TTTTCAAGTGCTCATTAGGTTTTATAT^CAAGAAGCTACATTTTTGCCCTTAAGACACTACTTACAGTGTTATGACT 
TGTATACACATATATTGGTATCAAAGGGGATAAAAGCCAATTTGTCTGTTACATTTCCTTTCACGTATTTCTTTTAG 
CAGCACTTCTGCTACTAAAGTTAATGTGTTTACTCTCTTTCCTTCCCACATTCTCAATTAAAAGGTGAGCTAAGCCT 
CCTCGGTGTTTCTGATTAACAGTAAATCCTAAATTCAAACTGTTAAATGACATTTTTATTTTTATGTCTCTCCTTAA 
CTATGAGACACATCTTGTTTTACTGAATTTCTTTCAATATTCCAGGTGATAGATTTTTGTCG 
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FIGURE 3 

GGCACGAGGCGATTCAGGGGAGGGAGCAACTGGAGCCTCAGGCCCTCCAGAGTAGTCTGCCTGACCACCCTGGAGCC 
CACAGAAGCCCAGGACGTCTCCCGCGAGGCCTCCCCGTGTGTGGCTGAGG ATG GCTGAGCAGCAGGGCCGGGAGCTT 

GAGGCTGAGTGCCCCGTCTGCTGGAACCCCTTCAACAACACGTTCCATACCCCCAAAATGCTGGATTGCTGCCACTC 
CTTCTGCGTGGAATGTCTGGCCCACCTCAGCCTTGTGACTCCAGCCCGGCGCCGCCTGCTGTGCCCACTCTGTCGCC 
AGCCCACAGTGCTGGCCTCAGGGCAGCCTGTCACTGACTTGCCCACGGACACTGCCATGCTCACCCTGCTCCGCCTG 
GAGCCCCACCATGTCATCCTGGAAGGCCATCAGCTGTGCCTCAAGGACCAGCCCAAGAGCCGCTACTTGCTGCGCCA 
GCCTCGAGTCTACACGCTGGACCTTGGCCCCCAGCCTGGGGGCCAGACTGGGCCGCCCCCAGACACGGCCTCTGCCA 
CCGTGTCTACGCCCATCCTCATCCCCAGCCACCACTCTTTGAGGGAGTGTTTCCGCAACCCTCAGTTCCGCATCTTT 
GCCTACCTGATGGCCGTCATCCTCAGTGTCACTCTGTTGCTCATATTCTCCATCTTTTGGACCAAGCAGTTCCTTTG 
GGGTGTGGGG TGA GTGCTGTTCCCAGACAAGAAACCAAACCTTTTTCGGTTGCTGCTGGGTATGGTGACTACGGAGC- 
CTCATTTGGTATTGTCTTCCTTTGTAGTGTTGTTTATTTTACAATCCAGGGATTGTTCAGGCCATGTGTTTGCTTCT 
GGGAACAATTTAAAAAAAAAACAAAAAAACGAAAAGCTTGAAGGACTGGGAGATGTGGAGCGACCTCCGGGTGTGAG 
TGTGGCGTCATGGAAGGGCAGAGAAGCGGTTCTGACCACAGAGCTCCACAGCAAGTTGTGCCAAAGGGCTGCACAGT 
GGTATCCAGGAACCTGACTAGCCCAAATAGCAAGTTGCATTTCTCACTGGAGCTGCTTCAAAATCAGTGCATATTTT 
TTTGAGTTGCTCTTTTACTATGGGTTGCTAAAAAAAAAAAAAAAAAAATTGGGAAGTGAGCTTCAATTCTGTGGGTA 
AATGTGTGTTTGTTTCTCTTTGAATGTCTTGCCACTGGTTGCAGTAAAAGTGTTCTGTATTCATTAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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GAAGCGCGCTCCCGGGGAGGTGTTGCAGCC ATG GCTACGGCAGCCGGCGCGACCTACTTTCAGCGAGGCAGTCTGTT 
CTGGTTCACAGTCATCACCCTCAGCTTTGGCTACTACACATGGGTTGTCTTCTGGCCTCAGAGTATCCCTTATCAGA 
ACCTTGGGCCCCTGGGCCCCTTCACTCAGTACTTGGTGGACCACCATCACACCCTCCTTTGCAATGGGTATTGGCTT 
GCCTGGCTGATTCATGTGGGAGAGTCCTTGTATGCCATAGCATTGTGCAAGCAT7W^GGCATCACi\AGTGGTCGGGC 
TCAGCTACTCTGGTTCCTACAGACTTTCTTCTTTGGGATAGCGTCTCTCACCATCTTGATTGCTTACAAACGGAAGC 
GCCAAAAACAAACTTGAAGTTGTCTGAAAGCTTGCTCTACACTTTTACATTCATCCTCACCCTTTTTTTTGTGGGGT 
AGAGGAGGTGCAGTAATTTACTCAGTGATCTTTCTACTTTCTAGAAACTGTCCTTCAAAGCTCTTTAAGACCCCCTC 
GTTAGTCAGTTTCTTCTCTTATATGCTCTGGTTGAGCTTGAATAGACCAGTTGTTACTTAAGAAAGAAACAGAGAAA 
GATTTTAGCTTTTCAATCCTATTTGGCAGAGGACTTCAGCTACCTTCTTACAGTCTTTGGCTGTGTTGGTACCCTCG 
TGTGCTCTGAGCTAAGCCACATACTAAACTGACTTTTTGGTTTGTATACCCTTGCTCCCGCCTTCTGATGAAAACAC 
CTTACCCTCACAACCACCATCTTTCCTCTCCTTTCCAAAGCTCTTTCCACCTTGCTGCACTAAGATAAAGTGACACT 
TCCACTATATGTCAATTCCACACACATTTATTAGGTACCTGTGAGGTAGGATCCTATCCTCTCAAACTTCCATTTCT 
CATGCTACAGAGAAAGATAAGGAAGATGAGCAAGTGCCTGGAATGGGGCAGGCTGAGCAGTCACACAGGCATAGAGG 
CACGCTGAGAACCTGGAGGGGAGACTGCAGAGTGCCTTCCCTGATGCTGCAGCCGGAAGTGATCCTTCCCTCCACCT 
GGCCCCTGGGACACTGTGCTCTGCAGTGTGCAGGGCCTGATGGCACTGCTAGATTGCTCCTTCAGCTCAGGGCCACA 
GCTTAAACAGCTTTACCTTTCCCCTCAGCACCTGTCCCACTATCTTGCACACAGGTGCTCTAACCATGTTTATTGAA 
CAAAGGAGGGAAACTGATTTCACTTTCACTTGTTCATTATCATTCCAATTTTTATGTGAAAATGGCACAACCCATTT 
GGGGTACCCTCACCCCAAAATAAAAGCCCAAGTCTACCTTTGACTGGTACCACCTTTTTTGTGGTTTCGTTGGTGAG 
AAACCTTTATCTTTTTCATACCTTTCTATTCTCAATCACTTCTCCAAAAGTGTGTCTTTCCAGCTCTGATTTATTCA 
AAACACAAGCATTTCTGTTTAGAGATTCTAGCCCATGGGTTATCTGGCTAGTTATTACCTCTCCTGTTCACTTAGTT 
ATACTTTATTATTGCTCACAGGCTGGGGAGGCAGAATGACTCTGTCACCACTAGGAGCCATTAGGGCTTCTTCCCTG 
GAGGACTGCCTGCTTGCTTTCTGGGGACACTAGCCCTCATTTCCCTTCTGTGGTACAGTGGGGCAAATTATTTGTAT 
TAAGCAAACATTTATGGGAAACAACCCGCTCCCGAAAACGGAGCCCCCAAGTAAAGCACAACCCTGTU^GATTATGA 
ACTATGAATTGTCTCTAGTAGAGATAAATTTCTGCAAACATATCTCAGTCTTCCCTCTGTTTCTCTGGTGATTAAGA 
AGTTCCTTTTTGGTAAGGAAAAGGATTTTTAACCATAGAGTTAGGCATCATGGAAATTCAAACCAGATTTCTTAATA 
CCTGGTCTTCCTCAAAGAGAAATAATAACAGTAATAGTGGTGCTGGGAACAATATGGCAGATTATTGAATGAAATTG 
ATTAACTTGAATAAAATGCTGTGAATTTTC 
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FIGURE 5 

ggcacgaggccgcagcggactgccctttcccaag atg gcgtcgaagataggttcgagacggtggatgttgcagctga 
tcatgcagttgggttcggtgctgctcacacgctgccccttttggggctgcttcagccagctcatgctgtacgctgag 
agggctgaggcacgccggaagcccgacatcccagtgccttacctgtatttcgacatgggggcagccgtgctgtgcgc 
tagtttcatgtcctttggcgtgaagcggcgctggttcgcgctgggggccgcactccaattggccattagcacctacg 
ccgcctacatcgggggctacgtccactacggggactggctgaaggtccgtatgtactcgcgcacagttgccatcatc 
ggcggctttcttgtgttggccagcggtgctggggagctgtaccgccggaaacctcgcagccgctccctgcagtccac 
cggccaggtgttcctgggtatctacctcatctgtgtggcctactcactgcagcacagcaaggaggaccggctggcgt 
atctgaaccatctcccaggaggggagctgatgatccagctgttcttcgtgctgtatggcatcctggccctggccttt 
ctgtcaggctactacgtgaccctcgctgcccagatcctggctgtactgctgccccctgtcatgctgctcattgatgg 
caatgttgcttactggcacaacacgcggcgtgttgagttctggaaccagatgaagctccttggagagagtgtgggca 
tcttcggaactgctgtcatcctggccactgatggc tga gttttatggcaagaggctgagatgggcacagggagccac 
tgagggtcaccctgccttcctccttgctggcccagctgctgtttatttatgctttttggtctgtttgtttgatcttt 
tgcttttttaaaattgttttttgcagttaagaggcagctcatttgtccaaatttctg'ggctcagcgcttgggagggc 
aggagccctggcactaatgctgtacaggtttttttcctgttaggagagctgaggccagctgcccactgagtctcctg 
tccctgagaagggagtatggcagggctgggatgcggctactgagagtgggagagtgggagacagaggaaggaagatg 
gagattggaagtgagcaaatgtgaaaaattcctctttgaacctggcagatgcagctaggctctgcagtgctgtttgg 
agactgtgagagggagtgtgtgtgttgacacatgtggatcaggcccaggaagggcacaggggctgagcactacagaa 
gtcacatgggttctcagggtatgccaggggcagaaacagtaccggctctctgtcactcaccttgagagtagagcaga 
ccctgttctgctctgggctgtgaaggggtggagcaggcagtggccagctttgcccttcctgctgtctctgtttctag 
ctccatggttggcctggtgggggtggagttccctcccaaacaccagaccacacagtcctccaaaaataaacatttta 
tatagacaaaaaaaaaaaaaaaaaaaaaaaaa 
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GGCCGCGGCTCGCCTTTGGCCCTTCTTATCAGGATGAAAACGCTTCTGTTTGGTGTCTGGGCCCTGCTGGCCTTGAT 
CCTTTGCCCAGGGGTCCCGGAAGAGTTGTTTGAGGTTTCTATTTGGCCAAGTCAGGCCCTGGTGGAGTTTGGACAGT 
CCCTAGTGTGCAACTGCAGCACTACTTGCCCAGACCCAGGACCCAGTGGAATTGAGACCTTCTTAAAGAAAACTCAG 
GTGGACAAAGGGCCTCAGTGGAAAGAGTTTCTTCTGGAGGATGTCACAGAGT^TTCCATCCTGCAGTGCTTCTTCTC 
TTGTGCAGGGATTCAAAAGGACACAAGCCTTGGCATCACTGTGTATCAGCCACCAGAGCAAGTGATCCTGGAGCTGC 
AGCCTGCCTGGGTGGCCGTGGACGAAGCCTTCACAGTGAAGTGTCATGTACCCAGTGTAGCACCCTTGGAGAGTCTC 
ACCCTTGCCCTTCTCCAGGGTAACCAAGAACTGCATAGAAAGAACTTTACGAGCTTGGCTGTGGCCTCCCAAAGAGC 
TGAAGTCATCATCAGTGTCAGAGCCCAAAAGGAGAATGACAGATGCAATTCTTCCTGCCATGCAGAACTGGACTTGA 
GTTTGCAAGGTGGGAGGCTCTTTCAAGGCAGCTCACCCATCAGAATAGTCCGGATCTTTGAATTCTCTCAGAGTCCC 
CACATCTGGGTCTCTTCCCTTTTGGAGGCTGGGATGGCGGAGACTGTGAGCTGCGAGGTGGCTAGGGTGTTTCCAGC 
CAAAGAAGTTATGTTCCACATGTTCCTGGAAGACCAAGAGCTGAGCTCCTCCCTTTCCTGGGAGGGGGACACAGCAT 
GGGCCAATGCTACCATTCGGACCATGGAGGCTGGTGATCAGGAACTGTCTTGCTTTGCATCTCTGGGTGCAATGGAA 
CAGAAGACAAGAAAGCTAGTGCATAGCTACAGCTTCCCTCCACCAATCCTGGAGCTAAAAGAATCATACCCATTGGC 
AGGGACCGACATTAATGTGACCTGCTCAGGGCATGTATTAACATCACCCAGCCCTACTCTTCGGCTTCAGGGAGCCC 
CAGACCTCCCTGCTGGGGAGCCTGCCTGGCTTCTACTTACTGCCAGGGAGGAAGATGATGGCXGAAATTTCTCCTGC 
GAGGCCTCTTTGGTGGTGCAGGGTCAGCGGTTGATGAAAACCACTGTGATCCAGCTCCATATCCTAAAGCCACAGTT 
AGAGGAATCCAGTTGCCCTGGCAAACAGACCTGGCTGGAAGGGATGGAACACACGCTCGCCTGCGTCCCAAAGGGAA 
ACCCAGCTCCAGCCTTGGTGTGTACCTGGAATGGGGTGGTCTTTGACCTTGAAGTGCCACAGAAGGCAACC TAGA AC 
CACACTGGAACCTACCGCTACACAGCCACTAACCAGCTGGGCTCTGTCAGCAAAGACATTGCTGT 



7/47 

FIGURE 7 



CCACGCGTCCGTTCTGAGGTGCATTCTTTTTTTGATGAGAGGCATCTCTAGGTACCATCCCTGACCTGGTCCTC ATG 
CTGCCGAGGCTGTTGCTGTTGATCTGTGCTCCACTCTGTGAACCTGCCGAGCTGTTTTTGATAGCCAGCCCCTCCCA 

TCCCACAGAGGGGAGCCCAGTGACCCTGACGTGTAAGATGCCCTTTCTACAGAGTTCAGATGCCCAGTTCCAGTTCT 
GCTTTTTCAGAGACACCCGGGCCTTGGGCCCAGGCTGGAGCAGCTCCCCCAAGCTCCAGATCGCTGCCATGTGGAAA 
GAAGACACAGGGTCATACTGGTGCGAGGCACAGACAATGGCGTCCAAAGTCTTGAGGAGCAGGAGATCCCAGATAAA 
TGTGCACATCCCGGTGTCTCGCCCAATCCTCATGCTCAGGGCTCCCAGGGCCCAGGCTGCAGTGGAGGATGTGCTGG 
AGCTTCACTGTGAGGCCCTGAGAGGCTCTCCTCCAATCCTGTACTGGTTTTATCACGAGGATATCACCCTGGGGAGC 
AGGTCGGCCCCCTCTGGAGGAGGAGCCTCCTTCAACCTTTCCCTGACTGAAGAACATTCTGGAAACTACTCCTGTGA 
GGCCAACAATGGCCTGGGGGCCCAGCGCAGTGAGGCGGTGACACTCAACTTCACAGTGCCTACTGGGGCCAGAAGCA 
ATCATCTTACCTCAGGAGTCATTGAGGGGCTGCTCAGCACCCTTGGTCCAGCCACCGTGGCCTTATTATTTTGCTAC 
GGCCTCAAAAGAAAAATAGGAAGACGTTCAGCCAGGGATCCACTCAGGAGCCTTCCAGCCTTACCCCAAGAGTTCAC 
CTACCTCAACTCACCTACCCCAGGGCAGCTACAGCCTATATATGAAAATGTGAATGTTGTAAGTGGGGATGAGGTTT 
ATTCACTGGCGTACTATAACCAGCCGGAGCAGGAATCAGTAGCAGCAGAAACCCTGGGGACACATATGGAGGACAAG 
GTTTCCTTAGACATCTATTCCAGGCTGAGGAAAGCAAACATTACAGATGTGGACTATGAAGATGCTATG TAA GGTTA 
TGGAAGATTCTGCTCTTTGAAAACCATCCATGACCCCAAGCCTCAGGCCTGATATGTTCTTCAGAGATCCTGGGGCA 
TTAGCTTTCCAGTATACCTCTTCTGGATGCCATTCTCCATGGCACTATTCCTTCATCTACTGTGAAGTGAAGTTGGC 
GCAGCCCTGAAGAAACTACCTAGGAGAACTAATAGACACAGGAGTGACAGGGACTTTGTTATCAGAACCAGATTCCT 
GCCGGCTCCTTTGAAAACAGGTCATATTGTGCTCTTCTGTTTACAAGAGGAAACAAGATGGAATAAAAGAAATTGGG 
ATCTTGGGTTGGAGGGACAGTGAAGCTTAGAGCACATGAACTCAAGGTTAGTGACTCTGCAGGACTTCACAGAGAGA 
GCTGTGCCCATCATTCAGTCCAAGTGCTTTCTCTGCCCAGACAGCACAGAACTCCAGCCCCGCTACTTACATGGATC 
ATCGAGTTTCCACCTAAAATATGATTCTATTTATTTTGAGTCACTGTTACCAAATTAGAACTAAAACAAAGTTACAT 
AAAAAGTTATTGTGACTCCACTTAATTTTAGTGACGTATTTTTGTATATATAGGCCAACCTATACCACATCCAAAAT 
TATGTATCTATTACAGCCCCTAGAAGCTTTATAAATACAGTGTGTCTTCTTTTATTCACAAAATTTTTGAAATCGTG 
GTAATATGGTTTGAAACCTGTATCTTAATTATTTTTTTTTTAAATTGAGACAGGGTCTCACTCTGTCACTCAATCTG 
GAATGCAGTGGCACAATCTTGCCTCACTGCAACGCCTGCCTCTCAGGCTCAAGCAAACCTCTCACCTCAGCCTGCTG 
AGTAGCTGGGACTACAGGCACATGCCACCAAACTTGGCCATTTTTTGTCTTACGTAGAGACAAGATTTCACCGTTTT 
GCCCAGGCTGGTCTCAAACTCCTGGGCTCAAGCAATGTATTGAATTTTAAAATAACCAGGCACTCACTCTTATGAAT 
TAATAAACATTTGGAGGTATATAAAGTAAAAAGTTAAAGTCTTTCCTGTAAGTTAACACAAATGTTAACTATTGTTA 
AAAACTTTACAGGTAGCTCTCTAGATATTTTTCTATTTTTGTATGTATACTTATGCATACATGTAAGTATATAAACA 
TTTAGAAGTGTACCTATCTAACAAACTATTATGAAATACTTTCAAATCTGTAAATAGATCTATTATACTATTTTAAA 
AGTCTCTATAGTAGTGTGTTATATAGATAAATCATAACTTTTTTCTTTTTTTATTGTAGTAAATATGCACAACATAA 
AATTGATCATTTTAACCATTTTTAAGTGTACAATTCAGTGGCATTAAGTACTATCATAATATATTTTAATCCTTCTC 
ATCACTGGTGGACATTAAGGAGACTCTCAAAAAATTCATATTATAAAAACAAAGTTCAAACAAATGTCTTTGTACTA 
GCATATTATGGCACTCCTGCTGGATTATCTGAAGGATAAATTTGTAAATCTAGTATTGCTAGATTATGCATATTAAA 
TATTCTTGTTAAATAGTCAAAAAAAAAAAAAAAA 
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FIGURE 8 

CTCAATCAGCTTTATGCAGAGAAGAAGCTTACTGAGCTCACTGCTGGTGCTGGTGTAGGCAAGTGCTGCTTTGGCAA 
TCTGGGCTGACCTGGCTTGTCTCCTCAGAACTCCTTCTCCAACCCTGGAGCAGGCTTCCATGCTGCTGTGGGCGTCC 
TTGCTGGCCTTTGCTCCAGTCTGTGGACAATCTGCAGCTGCACACAAACCTGTGATTTCCGTCCATCCTCCATGGAC 
CACATTCTTCAAAGGAGAGAGAGTGACTCTGACTTGCAATGGATTTCAGTTCTATGCAACAGAGAAAACAACATGGT 
ATCATCGGCACTACTGGGGAGAAAAGTTGACCCTGACCCCAGGAAACACCCTCGAGGTTCGGGAATCTGGACTGTAC 
AGATGCCAGGCCCGGGGCTCCCCACGAAGTAACCCTGTGCGCTTGCTCTTTTCTTCAGACTCCTTAATCCTGCAGGC 
ACCATATTCTGTGTTTGAAGGTGACACATTGGTTCTGAGATGCCACAGAAGAAGGAAAGAGAAATTGACTGCTGTGA 
AATATACTTGGAATGGAAACATTCTTTCCATTTCTAATAAAAGCTGGGATCTTCTTATCCCACAAGCAAGTTCAAAT 
AACAATGGCAATTATCGATGCATTGGATATGGAGATGAGAATGATGTATTTAGATCAAATTTCAAAATAATTAAAAT 
TCAAGAACTATTTCCACATCCAGAGCTGAAAGCTACAGACTCTCAGCCTACAGAGGGGAATTCTGTAAACCTGAGCT 
GTGAAACACAGCTTCCTCCAGAGCGGTCAGACACCCCACTTCACTTCAACTTCTTCAGAGATGGCGAGGTCATCCTG 
TCAGACTGGAGCACGTACCCGGAACTCCAGCTCCCAACCGTCTGGAGAGAAAACTCAGGATCCTATTGGTGTGGTGC 
TGAAACAGTGAGGGGTAACATCCACAAGCACAGTCCCTCGCTACAGATCCATGTGCAGCGGATCCCTGTGTCTGGGG 
TGCTCCTGGAGACCCAGCCCTCAGGGGGCCAGGCTGTTGAAGGGGAGATGCTGGTCCTTGTCTGCTCCGTGGCTGAA 
GGCACAGGGGATACCACATTCTCCTGGCACCGAGAGGACATGCAGGAGAGTCTGGGGAGGAAAACTCAGCGTTCCCT 
GAGAGCAGAGCTGGAGCTCCCTGCCATCAGACAGAGCCATGCAGGGGGATACTACTGTACAGCAGACAACAGCTACG 
GCCCTGTCCAGAGCATGGTGCTGAATGTCACTGTGAGAGAGACCCCAGGCAACAGAGATGGCCTTGTCGCCGCGGGA 
GCCACTGGAGGGCTGCTCAGTGCTCTTCTCCTGGCTGTGGCCCTGCTGTTTCACTGCTGGCGTCGGAGGAAGTCAGG 
AGTTGGTTTCTTGGGAGACGAAACCAGGCTCCCTCCCGCTCCAGGCCCAGGAGAGTCCTCCCATTCCATCTGCCCTG 
CCCAGGTGGAGCTTCAGTCGTTGTATGTTGATGTACACCCCAAAAAGGGAGATTTGGTATACTCTGAGATCCAGACT 
ACTCAGCTGGGAGAAGAAGAGGAAGCTAATACCTCCAGGACACTTCTAGAGGATAAGGATGTCTCAGTTGTCTACTC 
TGAGGTAAAGACACAACACCCAGATAACTCAGCTGGAAAGATCAGCTCTAAGGATGAAGAAAGTTAAGAGAATGAAA 
AGTTACGGGAACGTCCTACTCATGTGATTTCTCCCTTGTCCAAAGTCCCAGGCCCAGTGCAGTCCTTGCGGCACCTG 
GAATGATCAACTCATTCCAGCTTTCTAATTCTTCTCATGCATATGCATTCACTCCCAGGAATACTCATTCGTCTACT 
CTGATGTTGGGATGGAATGGCCTCTGAAAGACTTCACTAAAATGACCAGGATCCACAGTTAAGAGAAGACCCTGTAG 
TATTTGCTGTGGGCCTGACCTAATGCATTCCCTAGGGTCTGCTTTAGAGAAGGGGGATAAAGAGAGAGAAGGACTGT 
TATGAAAAACAGAAGCACAAATTTTGGTGAATTGGGATTTGCAGAGATGAAAAAGACTGGGTGACCTGGATCTCTGC 
TTAATACATCTACAACCATTGTCTCACTGGAGACTCACTTGCATCAGTTTGTTTAACTGTGAGTGGCTGCACAGGCA 
CTGTGCAAACAATGAAAAGCCCCTTCACTTCTGCCTGCACAGCTTACACTGTCAGGATTCAGTTGCAGATTAAAGAA 
CCCATCTGGAATGGTTTACAGAGAGAGGAATTTAAAAGAGGACATCAGAAGAGCTGGAGATGCAAGCTCTAGGCTGC 
GCTTCCAAAAGCAAATGATAATTATGTTAATGTCATTAGTGACAAAGATTTGCAACATTAGAGAAAAGAGACACAAA 
TATAAAATTAAAAACTTAAGTACCAACTCTCCAAAACTAAATTTGAACTTAAAATATTAGTATAAACTCATAATAAA 
CTCTGCCTTTAAAAAAAGATAAATATTTCCTACGTCTGTTCACTGAAATAATTACCAACCCCTTAGCAATAAGCACT 
CCTTGCAGAGAGGTTTTATTCTCTAAATACCATTCCCTTCTCAAAGGAAATAAGGTTGCTTTTCTTGTAGGAACTGT 
GTCTTTGAGTTACTAATTAGTTTATATGAGAATAATTCTTGCAATAAATGAAGAAGGAATAAAAGAAATAGGAAGCC 
ACAAATTTGTATGGATATTTCATGATACACCTACTGGTTAAATAATTGACAAAAACCAGCAGCCAAATATTAGAGGT 
CTCCTGATGGAAGTGTACAATACCACCTACAAATTATCCATGCCCCAAGTGTTAAAACTGAATCCATTCAAGTCTTT 
CTAACTGAATACTTGTTTTATAGAAAATGCATGGAGAAAAGGAATTTGTTTAAATAACATTATGGGATTGCAACCAG 
CAAAACATAAACTGAGAAAAAGTTCTATAGGGCAAATCACCTGGCTTCTATAACAAATAAATGGGAAAAAAATGAAA 
TAAAAAGAAGAGAGGGAGGAAGAAAGGGAGAGAGAAGAAAAGAAAAATGAAGAAAAGTAATTAGAATATTTTCAACA 
TAAAGAAAAGACGTVATATTTAAGGTGACAGATATCCCAACTACGCTGATTTGATCTTTACAAATTATATGAGTGTAT 
GAATTTGTCACATGTATCACCCCCAAAAAAAGAGAAAAAGAAAAATAGAAGACATATAAATTAAATGAGACGAGACA 
TGTCGACCAAAAGGAATGTGTGGGTCTTGTTTGGATCCTGACTCAAATTAAGAAAAAATAAAACTACCTACGAAATA 
CTAAGAAAAATTTGTATACTAATATTAAGAAATTGTTGTGTGTTTTGGATATAAGTGATAGTTTATTGTAGTGATGT 
TTTTATAAAAGCAAAAGGATATTCACTTTCAGCGCTTATACTGAAGTATTAGATTAAAGCTTATTAACGTA 
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GCCGAGCTGAGCGGATCCTCACATGACTGTGATCCGATTCTTTCCAGCGGCTTCTGCAACCAAGCGGGTCTTACCCC 
CGGTCCTCCGCGTCTCCAGTCCTCGCACCTGGAACCCCAACGTCCCCGAGAGTCCCCGAATCCCCGCTCCCAGGCTA 
CCTAAGAGGATGAGCGGTGCTCCGACGGCCGGGGCAGCCCTGATGCTCTGCGCCGCCACCGCCGTGCTACTGAGCGC 
TCAGGGCGGACCCGTGCAGTCCAAGTCGCCGCGCTTTGCGTCCTGGGACGAGATGAATGTCCTGGCGCACGGACTCC 
TGCAGCTCGGCCAGGGGCTGCGCGAACACGCGGAGCGCACCCGCAGTCAGCTGAGCGCGCTGGAGCGGCGCCTGAGC 
GCGTGCGGGTCCGCCTGTCAGGGAACCGAGGGGTCCACCGACCTCCCGTTAGCCCCTGAGAGCCGGGTGGACCCTGA 
GGTCCTTCACAGCCTGCAGACACAACTCAAGGCTCAGAACAGCAGGATCCAGCAACTCTTCCACAAGGTGGCCCAGC 
AGCAGCGGCACCTGGAGAAGCAGCACCTGCGAATTCAGCATCTGCAAAGCCAGTTTGGCCTCCTGGACCACAAGCAC 
CTAGACCATGAGGTGGCCAAGCCTGCCCGAAGAAAGAGGCTGCCCGAGATGGCCCAGCCAGTTGACCCGGCTCACAA 
TGTCAGCCGCCTGCACCGGCTGCCCAGGGATTGCCAGGAGCTGTTCCAGGTTGGGGAGAGGCAGAGTGGACTATTTG 
AAATCCAGCCTCAGGGGTCTCCGCCATTTTTGGTGAACTGCAAGATGACCTCAGATGGAGGCTGGACAGTAATTCAG 
AGGCGCCACGATGGCTCAGTGGACTTCAACCGGCCCTGGGAAGCCTACAAGGCGGGGTTTGGGGATCCCCACGGCGA 
GTTCTGGCTGGGTCTGGAGAAGGTGCATAGCATCACGGGGGACCGCAACAGCCGCCTGGCCGTGCAGCTGCGGGACT 
GGGATGGCAACGCCGAGTTGCTGCAGTTCTCCGTGCACCTGGGTGGCGAGGACACGGCCTATAGCCTGCAGCTCACT 
GCACCCGTGGCCGGCCAGCTGGGCGCCACCACCGTCCCACCCAGCGGCCTCTCCGTACCCTTCTCCACTTGGGACCA 
GGATCACGACCTCCGCAGGGACAAGAACTGCGCCAAGAGCCTCTCTGGAGGCTGGTGGTTTGGCACCTGCAGCCATT 
CCAACCTCAACGGCCAGTACTTCCGCTCCATCCCACAGCAGCGGCAGAAGCTTAAGAAGGGAATCTTCTGGAAGACC 
TGGCGGGGCCGCTACTACCCGCTGCAGGCCACCACCATGTTGATCCAGCCCATGGCAGCAGAGGCAGCCTCCTAGCG 
TCCTGGCTGGGCCTGGTCCCAGGCCCACGAAAGACGGTGACTCTTGGCTCTGCCCGAGGATGTGGCCGTTCCCTGCC 
TGGGCAGGGGCTCCAAGGAGGGGCCATCTGGAAACTTGTGGACAGAGAAGAAGACCACGACTGGAGAAGCCCCCTTT 
CTGAGTGCAGGGGGGCTGCATGCGTTGCCTCCTGAGATCGAGGCTGCAGGATATGCTCAGACTCTAGAGGCGTGGAC 
CAAGGGGCATGGAGCTTCACTCCTTGCTGGCCAGGGAGTTGGGGACTCAGAGGGACCACTTGGGGCCAGCCAGACTG 
GCCTCAATGGCGGACTCAGTCACATTGACTGACGGGGACCAGGGCTTGTGTGGGTCGAGAGCGCCCTCATGGTGCTG 
GTGCTGTTGTGTGTAGGTCCCCTGGGGACACAAGCAGGCGCCAATGGTATCTGGGCGGAGCTCACAGAGTTCTTGGA 
ATAAAAGCAACCTCAGAACAC 
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GATGTGCTCCTTGGAGCTGGTGTGCAGTGTCCTGACTGTAAGATCAAGTCCAAACCTGTTTTGGAATTGAGGAAACT 
TCTCTTTTGATCTCAGCCCTTGGTGGTCCAGGTCTTC ATG CTGCTGTGGGTGATATTACTGGTCCTGGCTCCTGTCA 
GTGGACAGTTTGCAAGGACACCCAGGCCCATTATTTTCCTCCAGCCTCCATGGACCACAGTCTTCCAAGGAGAGAGA 
GTGACCCTCACTTGCAAGGGATTTCGCTTCTACTCACCACAGAAAACAAAATGGTACCATCGGTACCTTGGGAAAGA 
AATACTAAGAGAAACCCCAGACAATATCCTTGAGGTTCAGGAATCTGGAGAGTACAGATGCCAGGCCCAGGGCTCCC 
CTCTCAGTAGCCCTGTGCACTTGGATTTTTCTTCAGAGATGGGATTTCCTCATGCTGCCCAGGCTAATGTTGAACTC 
CTGGGCTCAAGTGATCTGCTCACCTAGGCCTCTCAAAGCGCTGGGATTACAGCTTCGCTGATCCTGCi\AGCTCCACT 
TTCTGTGTTTGAAGGAGACTCTGTGGTTCTGAGGTGCCGGGCAAAGGCGGAAGTAACACTGAATAATACTATTTACA 
AGAATGATAATGTCCTGGCATTCCTTAATAAAAGAACTGACTTCCAAAAAAAAAAAAAAAAAAAAAAAA 



\ 
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CGCCCGCGCGCTGCAGCCCCATCTCCTAGCGGCAGCCCAGGCGCGGAGGGAGCGAGTCCGCCCCGAGGTAGGTCCAG 
GACGGGCGCACAGCAGCAGCCGAGGCTGGCCGGGAGAGGGAGGAAGAGGATGGCAGGGCCACGCCCCAGCCCATGGG 
CCAGGCTGCTCCTGGCAGCCTTGATCAGCGTCAGCCTCTCTGGGACCTTGGCAAACCGCTGCAAGAAGGCCCCAGTG 
AAGAGCTGCACGGAGTGTGTCCGTGTGGATAAGGACTGCGCCTACTGCACAGACGAGATGTTCAGGGACCGGCGCTG 
CAACACCCAGGCGGAGCTGCTGGCCGCGGGCTGCCAGCGGGAGAGCATCGTGGTCATGGAGAGCAGCTTCCT^TCA 
CAGAGGAGACCCAGATTGACACCACCCTGCGGCGCAGCCAGATGTCCCCCCAAGGCCTGCGGGTCCGTCTGCGGCCC 
GGTGAGGAGCGGCATTTTGAGCTGGAGGTGTTTGAGCCACTGGAGAGCCCCGTGGACCTGTACATCCTCATGGACTT 
CTCCAACTCCATGTCCGATGATCTGGACAACCTCAAGAAGATGGGGCAGAACCTGGCTCGGGTCCTGAGCCAGCTCA 
CCAGCGACTACACTATTGGATTTGGCAAGTTTGTGGACAAAGTCAGCGTCCCGCAGACGGACATGAGGCCTGAGAAG 
CTGAAGGAGCCCTGGCCCAACAGTGACCCCCCCTTCTCCTTCAAGAACGTCATCAGCCTGACAGAAGATGTGGATGA 
GTTCCGGAATAAACTGCAGGGAGAGCGGATCTCAGGCAACCTGGATGCTCCTGAGGGCGGCTTCGATGCCATCCTGC 
AGACAGCTGTGTGCACGAGGGACATTGGCTGGCGCCCGGACAGCACCCACCTGCTGGTCTTCTCCACCGAGTCAGCC 
TTCCACTATGAGGCTGATGGCGCCAACGTGCTGGCTGGCATCATGAGCCGCAACGATGAACGGTGCCACCTGGACAC 
CACGGGCACCTACACCCAGTACAGGACACAGGACTACCCGTCGGTGCCCACCCTGGTGCGCCTGCTCGCCAAGCACA 
ACATCATCCCCATCTTTGCTGTCACCAACTACTCCTATAGCTACTACGAGAAGCTTCACACCTATTTCCCTGTCTCC 
TCACTGGGGGTGCTGCAGGAGGACTCGTCCAACATCGTGGAGCTGCTGGAGGAGGCCTTCAATCGGATCCGCTCCAA 
CCTGGACATCCGGGCCCTAGACAGCCCCCGAGGCCTTCGGACAGAGGTCACCTCCAAGATGTTCCAGAAGACGAGGA 
CTGGGTCCTTTCACATCCGGCGGGGGGAAGTGGGTATATACCAGGTGCAGCTGCGGGCCCTTGAGCACGTGGATGGG 
ACGCACGTGTGCCAGCTGCCGGAGGACCAGAAGGGCAACATCCATCTGAAACCTTCCTTCTCCGACGGCCTCAAGAT 
GGACGCGGGCATCATCTGTGATGTGTGCACCTGCGAGCTGCAAAAAGAGGTGCGGTCAGCTCGCTGCAGCTTCAACG 
GAGACTTCGTGTGCGGACAGTGTGTGTGCAGCGAGGGCTGGAGTGGCCAGACCTGCAACTGCTCCACCGGCTCTCTG 
AGTGACATTCAGCCCTGCCTGCGGGAGGGCGAGGACAAGCCGTGCTCCGGCCGTGGGGAGTGCCAGTGCGGGCACTG 
TGTGTGCTACGGCGAAGGCCGCTACGAGGGTCAGTTCTGCGAGTATGACAACTTCCAGTGTCCCCGCACTTCCGGGT 
TCCTCTGCAATGACCGAGGACGCTGCTCCATGGGCCAGTGTGTGTGTGAGCCTGGTTGGACAGGCCCAAGCTGTGAC 
TGTCCCCTCAGCAATGCCACCTGCATCGACAGCAATGGGGGCATCTGTAATGGACGTGGCCACTGTGAGTGTGGCCG 
CTGCCACTGCCACCAGCAGTCGCTCTACACGGACACCATCTGCGAGATCAACTACTCGGCGATCCACCCGGGCCTCT 
GCGAGGACCTACGCTCCTGCGTGCAGTGCCAGGCGTGGGGCACCGGCGAGAAGAAGGGGCGCACGTGTGAGGAATGC 
AACTTCAAGGTCAAGATGGTGGACGAGCTTAAGAGAGCCGAGGAGGTGGTGGTGCGCTGCTCCTTCCGGGACGAGGA 
TGACGACTGCACCTACAGCTACACCATGGAAGGTGACGGCGCCCCTGGGCCCAACAGCACTGTCCTGGTGCACAAGA 
AGAAGGACTGCCCTCCGGGCTCCTTCTGGTGGCTCATCCCCCTGCTCCTCCTCCTCCTGCCGCTCCTGGCCCTGCTA 
CTGCTGCTATGCTGGAAGTACTGTGCCTGCTGCAAGGCCTGCCTGGCACTTCTCCCGTGCTGCAACCGAGGTCACAT 
GGTGGGCTTTAAGGAAGACCACTACATGCTGCGGGAGAACCTGATGGCCTCTGACCACTTGGACACGCCCATGCTGC 
GCAGCGGGAACCTCAAGGGCCGTGACGTGGTCCGCTGGAAGGTCACCAACAACATGCAGCGGCCTGGCTTTGCCACT 
CATGCCGCCAGCATCAACCCCACAGAGCTGGTGCCCTACGGGCTGTCCTTGCGCCTGGCCCGCCTTTGCACCGAGAA 
CCTGCTGAAGCCTGACACTCGGGAGTGCGCCCAGCTGCGCCAGGAGGTGGAGGAGAACCTGAACGAGGTCTACAGGC 
AGATCTCCGGTGTACACAAGCTCCAGCAGACCAAGTTCCGGCAGCAGCCCAATGCCGGGAAAAAGCAAGACCACACC 
ATTGTGGACACAGTGCTGATGGCGCCCCGCTCGGCCAAGCCGGCCCTGCTGAAGCTTACAGAGAAGCAGGTGGAACA 
GAGGGCCTTCCACGACCTCAAGGTGGCCCCCGGCTACTACACCCTCACTGCAGACCAGGACGCCCGGGGCATGGTGG 
AGTTCCAGGAGGGCGTGGAGCTGGTGGACGTACGGGTGCCCCTCTTTATCCGGCCTGAGGATGACGACGAGAAGCAG 
CTGCTGGTGGAGGCCATCGACGTGCCCGCAGGCACTGCCACCCTCGGCCGCCGCCTGGTAAACATCACCATCATCAA 
GGAGCAAGCCAGAGACGTGGTGTCCTTTGAGCAGCCTGAGTTCTCGGTCAGCCGCGGGGACCAGGTGGCCCGCATCC 
CTGTCATCCGGCGTGTCCTGGACGGCGGGAAGTCCCAGGTCTCCTACCGCACACAGGATGGCACCGCGCAGGGCAAC 
CGGGACTACATCCCCGTGGAGGGTGAGCTGCTGTTCCAGCCTGGGGAGGCCTGGAAAGAGCTGCAGGTGAAGCTCCT 
GGAGCTGCAAGAAGTTGACTCCCTCCTGCGGGGCCGCCAGGTCCGCCGTTTCCACGTCCAGCTCAGCAACCCTAAGT 
TTGGGGCCCACCTGGGCCAGCCCCACTCCACCACCATCATCATCAGGGACCCAGATGAACTGGACCGGAGCTTCACG 
AGTCAGATGTTGTCATCACAGCCACCCCCTCACGGCGACCTGGGCGCCCCGCAGAACCCCAATGCTAAGGCCGCTGG 
GTCCAGGAAGATCCATTTCAACTGGCTGCCCCCTTCTGGCAAGCCAATGGGGTACAGGGTAAAGTACTGGATTCAGG 
GTGACTCCGAATCCGAAGCCCACCTGCTCGACAGCAAGGTGCCCTCAGTGGAGCTCACCAACCTGTACCCGTATTGC 
GACTATGAGATGAAGGTGTGCGCCTACGGGGCTCAGGGCGAGGGACCCTACAGCTCCCTGGTGTCCTGCCGCACCCA 
CCAGGAAGTGCCCAGCGAGCCAGGGCGTCTGGCCTTCAATGTCGTCTCCTCCACGGTGACCCAGCTGAGCTGGGCTG 
AGCCGGCTGAGACCAACGGTGAGATCACAGCCTACGAGGTCTGCTATGGCCTGGTCAACGATGACAACCGACCTATT 
GGGCCCATGAAGAAAGTGCTGGTTGACAACCCTAAGAACCGGATGCTGCTTATTGAGAACCTTCGGGAG 
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TCCCAGCCCTACCGCTACACGGTGAAGGCGCGCAACGGGGCCGGCTGGGGGCCTGAGCGGGAGGCCATCATCAACCT 
GGCCACCCAGCCCAAGAGGCCCATGTCCATCCCCATCATCCCTGACATCCCTATCGTGGACGCCCAGAGCGGGGAGG 
ACTACGACAGCTTCCTTATGTACAGCGATGACGTTCTACGCTCTCCATCGGGCAGCCAGAGGCCCAGCGTCTCCGAT 
GACACTGAGCACCTGGTGAATGGCCGGATGGACTTTGCCTTCCCGGGCAGCACCAACTCCCTGCACAGGATGACCAC 
GACCAGTGCTGCTGCCTATGGCACCCACCTGAGCCCACACGTGCCCCACCGCGTGCTAAGCACATCCTCCACCCTCA 
CACGGGACTACAACTCACTGACCCGCTCAGAACACTCACACTCGACCACACTGCCGAGGGACTACTCCACCCTCACC 
TCCGTCTCCTCCCACGACTCTCGCCTGACTGCTGGTGTGCCCGACACGCCCACCCGCCTGGTGTTCTCTGCCCTGGG 
GCCCACATCTCTCAGAGTGAGCTGGCAGGAGCCGCGGTGCGAGCGGCCGCTGCAGGGCTACAGTGTGGAGTACCAGC 
TGCTGAACGGCGGTGAGCTGCATCGGCTCAACATCCCCAACCCTGCCCAGACCTCGGTGGTGGTGGAAGACCTCCTG 
CCCAACCACTCCTACGTGTTCCGCGTGCGGGCCCAGAGCCAGGAAGGCTGGGGCCGAGAGCGTGAGGGTGTCATCAC 
CATTGAATCCCAGGTGCACCCGCAGAGCCCACTGTGTCCCCTGCCAGGCTCCGCCTTCACTTTGAGCACTCCCAGTG 
CCCCAGGCCCGCTGGTGTTCACTGCCCTGAGCCCAGACTCGCTGCAGCTGAGCTGGGAGCGGCCACGGAGGCCCAAT 
GGGGATATCGTCGGCTACCTGGTGACCTGTGAGATGGCCCAAGGAGGAGGGCCAGCCACCGCATTCCGGGTGGATGG 
AGACAGCCCCGAGAGCCGGCTGACCGTGCCGGGCCTCAGCGAGAACGTGCCCTACAAGTTCAAGGTGCAGGCCAGGA 
CCACTGAGGGCTTCGGGCCAGAGCGCGAGGGCATCATCACCATAGAGTCCCAGGATGGAGGACCCTTCCCGCAGCTG 
GGCAGCCGTGCCGGGCTCTTCCAGCACCCGCTGCAAAGCGAGTACAGCAGCATCACCACCACCCACACCAGCGCCAC 
CGAGCCCTTCCTAGTGGATGGGCCGACCCTGGGGGCCCAGCACCTGGAGGCAGGCGGCTCCCTCACCCGGCATGTGA 
CCCAGGAGTTTGTGAGCCGGACACTGACCACCAGCGGAACCCTTAGCACCCACATGGACCAACAGTTCTTCCAAACT 
TGACCGCACCCTGCCCCACCCCCGCCATGTCCCACTAGGCGTCCTCCCGACTCCTCTCCCGGAGCCTCCTCAGCTAC 
TCCATCCTTGCACCCCTGGGGGCCCAGCCCACCCGCATGCACAGAGCAGGGGCTAGGTGTCTCCTGGGAGGCATGAA 
GGGGGCAAGGTCCGTCCTCTGTGGGCCCAAACCTATTTGTAACCAAAGAGCTGGGAGCAGCACAAGGACCCAGCCTT 
TGTTCTGCACTTAATAAATGGTTTTGCTACTG 
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GCCTTCAACTACCATCCCACCACCTGCTGAGGAGT^AAAATTCTTCAAGACTCAGAGCACACAGCCAGCACCAGAGGC 
CCCATGACCCTGGACAGACCAGGGGAGGGGGCCACCATGCTGAAGACATTCACTGTTTTGCTCTTTTGCATTCGGAT 
GAGTCTGGGTATGACATCGATAGTGATGGACCCTCAACCGGAGTTGTGGATAGAGTCCAACTACCCCCAGGCCCCTT 
GGGAGAACATCACGCTTTGGTGCCGAAGCCCCTCTCGGATATCAAGCAAGTTCCTGCTGCTGAAGGATAAGACACAA 
ATGACCTGGATCCGCCCTTCCCACAAGACCTTCCAAGTTTCATTCCTTATAGGTGCCCTTACTGAGTCCAATGCAGG 
TCTTTACCGGTGCTGCTACTGGAAGGAGACAGGCTGGTCAAAGCCCAGTAAAGTTCTAGAGTTGGAGGCACCAGGCC 
AACTGCCCAAGCCCATCTTCTGGATTCAGGCTGAGACCCCCGCTCTTCCTGGGTGTAATGTTAACATCCTCTGCCAT 
GGCTGGCTGCAGGATTTGGTATTCATGCTGTTTA/U^GAGGGATATGCAGAGCCTGTGGATTACCAAGTCCCAACTGG 
GACAATGGCCATATTCTCCATTGACAACCTGACACCTGAGGATGAAGGGGTTTACATCTGCCGCACTCATATCCAGA 
TGCTCCCCACCCTGTGGTCAGAGCCCAGCAACCCCCTGAAGCTGGTTGTAGCAGGACTCTACCCCAAACCAACTTTG 
ACAGCCCATCCTGGGCCCATCATGGCACCTGGAGAAAGCCTGAATCTCAGGTGCCAAGGGCCAATCTATGGAATGAC 
CTTTGCTCTAATGAGGGTTGAAGACTTGGAGAAGTCCTTTTACCACAAGAAGACAATAAAAAATGAGGCAAATTTCT 
TCTTCCAGTCTTTGAAGATCCAAGATACTGGACATTACCTCTGTTTTTACTATGACGCATCATATAGAGGTTCACTC 
CTTAGTGATGTCCTGAAAATCTGGGTAACTGACACTTTCCCCAAGACCTGGCTACTTGCTCGGCCCAGTGCTGTGGT 
CCAAATGGGTCAGAATGTGAGCCTACGGTGTCGAGGACCAGTGGATGGAGTGGGTCTTGCACTCTATAAGAAAGGAG 
AAGACAAACCACTTCAATTTTTGGATGCCACCAGCATCGATGACAACACATCATTCTTCCTCAACAATGTAACCTAC 
AGTGATACTGGCATCTATAGCTGCCACTATCTTCTCACCTGGAAGACCTCCATTAGGATGCCATCACACAACACTGT 
GGAGCTTATGGTTGTAGATAAGCCCCCCAAACCCTCCCTGTCAGCTTGGCCAAGCACTGTGTTCAAGCTAGGAAAGG 
CCATCACCCTTCAGTGCCGAGTATCTCATCCAGTACTGGAATTTTCTCTGGAATGGGAAGAAAGAGAAACATTCCAA 
AGATTCTCAGTAAACGGAGACTTCATCATCAGTAATGTTGACGGGAAAGGCACAGGGACCTACAGTTGCAGCTATCG 
CGTAGAGACACATCCTAACATGTGGTCACATCGCAGTGAGCCCCTGAAGCTGATGGGGCCAGCAGGCTATCTCACCT 
GGAATTACGTTCTGAATGAAGCTATCAGGTTGTCTCTAATCATGCAGCTTGTTGCCTTGCTGTTGGTAGTGCTGTGG 
ATAAGGTGGAAGTGTCGGAGACTCAGAATCAGAGAAGCCTGGTTGCTGGGAACAGCTCAAGGGGTCACCATGCTCTT 
CATAGTCACGGCCCTTCTCTGCTGTGGACTGTGCAATGGGGTATTGATAGAAGAGACTGAAATAGTCATGCCAACCC 
CTAAGCCTGAGCTGTGGGCAGAGACCAACTTTCCTCTGGCCCCGTGGAAGAACTTAACCCTCTGGTGCAGAAGCCCT 
TCTGGCTCAACTAAGGAGTTTGTGTTGCTGAAGGATGGGACCGGGTGGATCGCCACTCGCCCGGCCTCAGAGCAGGT 
CCGGGCTGCCTTCCCCCTTGGCGCCCTGACCCAGAGCCACACCGGGAGCTACCACTGCCATTCATGGGAGGAGATGG 
CTGTATCGGAGCCCAGTGAGGCACTTGAGCTGGTGGGGACAGACATCCTCCCCAAACCTGTCATTTCTGCTTCCCCC 
ACAATCCGGGGCCAGGAACTACAACTCCGGTGCAAAGGATGGCTGGCAGGCATGGGGTTTGCTCTGTATAAGGAGGG 
AGAGCAAGAACCTGTCCAGCAACTTGGTGCTGTTGGAAGAGAAGCCTTCTTTACAATCCAGAGAATGGAGGATAAAG 
ACGAAGGCAATTACAGCTGCCGCACTCACACTGAAAAACTCCCCTTCAAGTGGTCTGAGCCCAGTGAGCCGCTGGAG 
CTTGTCATAAAAGAAATGTACCCTAAGCCCTTCTTCAAGACATGGGCCAGCCCTGTGGTCACCCCTGGTGCCCGAGT 
GACTTTCAATTGCTCCACCCCCCACCAGCATATGAGCTTTATTCTTTACAAAGATGGAAGTGAAATAGCATCCAGTG 
ACAGGTCCTGGGCAAGTCCGGGGGCCAGTGCAGCTCACTTTCTAATCATTTCGGTGGGCATTGGTGATGGAGGGAAT 
TACAGCTGCCGATATTATGACTTTTCTATCTGGTCTGAGCCCAGCGACCCTGTGGAGCTCGTGGTGACAGAATTCTA 
CCCCAAACCCACTCTCCTGGCACAGCCAGGTCCTGTGGTGTTTCCTGGGAAGAGTGTGATCCTGCGCTGCCAAGGGA 
CTTTCCAGGGCATGAGGTTCGCCCTCTTGCAGGAGGGAGCCCATGTTCCCTTACAGTTTCGGAGTGTCTCAGGGAAC 
TCAGCTGACTTCCTTCTCCACACTGTTGGAGCAGAGGACTCTGGGAACTATAGCTGTATCTACTATGAGACAACCAT 
GTCAAACAGGGGGTCATATCTCAGTATGCCCCTTATGATCTGGGTGACTGACACATTCCCTAAGCCATGGTTGTTTG 
CTGAGCCCAGTTCTGTGGTTCCCATGGGGCAGAATGTTACTCTCTGGTGCCGAGGGCCGGTCCATGGAGTAGGATAC 
ATTCTGCACAAAGAAGGAGAAGCCACTTCAATGCAGCTCTGGGGATCCACCAGTAATGACGGGGCATTCCCCATCAC 
CAATATATCTGGTACTAGCATGGGGCGTTACAGCTGCTGCTACCACCCTGACTGGACCAGTTCTATCAAGATACAAC 
CTAGCAACACCCTGGAACTCCTAGTCACAGGCTTACTCCCCAAACCCAGCCTATTAGCCCAGCCTGGTCCCATGGTG 
GCCCCTGGCGAAAATATGACTCTTCAGTGTCAAGGGGAACTGCCAGACTCAACATTTGTGCTGTTGAAGGAGGGGGC 
TCAGGAGCCTTTAGAGCAACAGAGGCCAAGTGGGTACAGGGCTGACTTCTGGATGCCAGCAGTGAGAGGTGAAGACT 
CTGGGATCTATAGCTGTGTTTATTATTTGGACTCTACTCCCTTTGCAGCTTCAAATCACAGTGACTCCCTGGAGATC 
TGGGTGACTGATAAGCCCCCTAAACCCTCTCTGTCAGCCTGGCCCAGCACCATGTTCAAGTTAGGGAAGGACATCAC 
CCTTCAGTGCCGAGGACCCCTGCCAGGTGTTGAATTTGTTCTAGAACATGATGGAGAAGAAGCACCTCAGCAGTTTT 
CAGAGGATGGAGACTTTGTCATCAACAACGTAGAAGGAAAAGGCATTGGAAACTACAGCTGCAGCTACCGCCTCCAG 
GCCTACCCTGATATCTGGTCAGAGCCTAGTGATCCCCTGGAGCTGGTGGGGGCAGCAGGGCCTGTTGCTCAGGAGTG 
CACTGTAGGGAACATTGTCCGAAGTAGCCTAATCGTGGTGGTTGTTGTAGCCTTGGGGGTAGTGCTAGCCATAGAGT 
GGAAGAAGTGGCCTCGACTGCGAACCAGAGGCTCAGAGACAGACGGAAGAGACCAGACCATTGCCCTTG 



14/47 

FIGURE 12B 

AAGAGTGTAACCAAGAAGGAGAACCAGGCACCCCTGCCAATTCTCCTTCATCAACCTCTCAGAGAATCTCTGTGGAA 
CTGCCCGTTCCAATATAATAATCTCCTCCTTTACAAGAGCTTTCCTCTCCTCTCTCTTGCTCTCAGAGACCTATAAA 
TCCAACCAGTTACCCTGCAAGTCAGCCCCATCTGCTGTTCCTTGGTCTCTAATCACCTGAGCTGGGTAAAGGGGATT 
CTGGGAGTTGAGAGCTCTGCCAGGGTGAGATGTTTCCTGAAGAGAGGTTCCCCACCCCTGTAACTCCTCACTGTACT 
GATTTACTGGCGCATGAAATTCTATTAAAAATGCATTCTTCTGAATAAAAAGAGTATTCACTATTTAACTTCAATTT 
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CGGGAGCGGCGGGAGCGGTGGCGGCGGCAGAGGCGGCGGCTCCAGCTTCGGCTCCGGCTCGGGCTCGGGCTCCGGCT 
CCGGCTCCGGCTCCGGCTCCAGCTCGGGTGGCGGTGGCGGGAGCGGGACCAGGTGGAGGCGGCGGCGGCAGAGGAGT 
GGGAGCAGCGGCCCTAGCGGCTTGCGGGGGGACATGCGGACCGACGGCCCCTGGATAGGCGGAAGGAGTGGAGGCCC 
TGGTGCCCGGCCCTTGGTGCTGAGTATCCAGCAAGAGTGACCGGGGTGAAGAAGCAAAGACTCGGTTGATTGTCCTG 
GGCTGTGGCTGGCTGTGGAGCTAGAGCCCTGGATGGCCCCTGAGCCAGCCCCAGGGAGGACGATGGTGCCCCTTGTG 
CCTGCACTGGTGATGCTTGGTTTGGTGGCAGGCGCCCATGGTGACAGCAAACCTGTCTTCATTAAAGTCCCTGAGGA 
CCAGACTGGGCTGTCAGGAGGGGTAGCCTCCTTCGTGTGCCAAGCTACAGGAGAACCCAAGCCGCGCATCACATGGA 
TGAAGAAGGGGAAGAAAGTCAGCTCCCAGCGCTTCGAGGTCATTGAGTTTGATGATGGGGCAGGGTCAGTGCTTCGG 
ATCCAGCCATTGCGGGTGCAGCGAGATGAAGCCATCTATGAGTGTACAGCTACTAACAGCCTGGGTGAGATCAACAC 
TAGTGCCAAGCTCTCAGTGCTCGAAGAGGAACAGCTGCCCCCTGGGTTCCCTTCCATCGACATGGGGCCTCAGCTGA 
AGGTGGTGGAGAAGGCACGCACAGCCACCATGCTATGTGCCGCAGGCGGAAATCCAGACCCTGAGATTTCTTGGTTC 
AAGGACTTCCTTCCTGTAGACCCTGCCACGAGCAACGGCCGCATCAAGCAGCTGCGTTCAGGTGCCTTGCAGATAGA 
GAGCAGTGAGGAATCCGACCAAGGCAAGTACGAGTGTGTGGCGACCAACTCGGCAGGCACACGTTACTCAGCCCCTG 
CGAACCTGTATGTGCGAGTGCGCCGCGTGGCTCCTCGTTTCTCCATCCCTCCCAGCAGCCAGGAGGTGATGCCAGGC 
GGCAGCGTGAACCTGACATGCGTGGCAGTGGGTGCACCCATGCCCTACGTGAAGTGGATGATGGGGGCCGAGGAGCT 
CACCAAGGAGGATGAGATGCCAGTTGGCCGCAACGTCCTGGAGCTCAGCAATGTCGTACGCTCTGCCAACTACACCT 
GTGTGGCCATCTCCTCGCTGGGCATGATCGAGGCCACAGCCCAGGTCACAGTGAAAGCTCTTCCAAAGCCTCCGATT 
GATCTTGTGGTGACAGAGACAACTGCCACCAGTGTCACCCTCACCTGGGACTCTGGGAACTCGGAGCCTGTAACCTA 
CTATGGCATCCAGTACCGCGCAGCGGGCACGGAGGGCCCCTTTCAGGAGGTGGATGGTGTGGCCACCACCCGCTACA 
GCATTGGCGGCCTCAGCCCTTTCTCGGAATATGCCTTCCGCGTGCTGGCGGTGAACAGCATCGGGCGAGGGCCGCCC 
AGCGAGGCAGTGCGGGCACGCACGGGAGAACAGGCGCCCTCCAGCCCACCGCGCCGCGTGCAGGCACGCATGCTGAG 
CGCCAGCACCATGCTGGTGCAGTGGGAGCCTCCCGAGGAGCCCAACGGCCTGGTGCGGGGATACCGCGTCTACTATA 
CTCCGGACTCCCGCCGCCCCCCGAACGCCTGGCACAAGCACAACACCGACGCGGGGCTCCTCACGACCGTGGGCAGC 
CTGCTGCCTGGCATCACCTACAGCCTGCGCGTGCTTGCCTTCACCGCCGTGGGCGATGGCCCTCCCAGCCCCACCAT 
CCAGGTCAAGACGCAGCAGGGAGTGCCTGCCCAGCCCGCGGACTTCCAGGCCGAGGTGGAGTCGGACACCAGGATCC 
AGCTCTCGTGGCTGCTGCCCCCTCAGGAGCGGATCATCATGTATGAACTGGTGTACTGGGCGGCAGAGGACGAAGAC 
CAACAGCACAAGGTCACCTTCGACCCAACCTCCTCCTACACACTAGAGGACCTGAAGCCTGACACACTCTACCGCTT 
CCAGCTGGCTGCACGCTCGGATATGGGGGTGGGCGTCTTCACCCCCACCATTGAGGCCCGCACAGCCCAGTCCACCC 
CCTCCGCCCCTCCCCAGAAGGTGATGTGTGTGAGCATGGGCTCCACCACGGTCCGGGTAAGTTGGGTCCCGCCGCCT 
GCCGACAGCCGCAACGGCGTTATCACCCAGTACTCCGTGGCCCACGAGGCGGTGGACGGCGAGGACCGCGGGCGGCA 
TGTGGTGGATGGCATCAGCCGTGAGCACTCCAGCTGGGACCTGGTGGGCCTGGAGAAGTGGACGGAGTACCGGGTGT 
GGGTGCGGGCACACACAGACGTGGGCCCCGGCCCCGAGAGCAGCCCGGTGCTGGTGCGCACCGATGAGGACGTGCCC 
AGCGGGCCTCCGCGGAAGGTGGAGGTGGAGCCACTGAACTCCACTGCTGTGCATGTCTACTGGAAGCTGCCTGTCCC 
CAGCAAGCAGCATGGCCAGATCCGCGGCTACCAGGTCACCTACGTGCGGCTGGAGAATGGCGAGCCCCGTGGACTCC 
CCATCATCCAAGACGTCATGCTAGCCGAGGCCCAGTGGCGGCCAGAGGAGTCCGAGGACTATGAAACCACTATCAGC 
GGCCTGACCCCGGAGACCACCTACTCCGTTACTGTTGCTGCCTATACCACCAAGGGGGATGGTGCCCGCAGCAAGCC 
CAAAATTGTCACTACAACAGGTGCAGTCCCAGGCCGGCCCACCATGATGATCAGCACCACGGCCATGAACACTGCGC 
TGCTCCAGTGGCACCCACCCAAGGAACTGCCTGGCGAGCTGCTGGGCTACCGGCTGCAGTACTGCCGGGCCGACGAG 
GCGCGGCCCAACACCATAGATTTCGGCAAGGATGACCAGCACTTCACAGTCACCGGCCTGCACAAGGGGACCACCTA 
CATCTTCCGGCTTGCTGCCAAGAACCGGGCTGGCTTGGGTGAGGAGTTCGAGAAGGAGATCAGGACCCCCGAGGACC 
TGCCCAGCGGCTTCCCCCAAAACCTGCATGTGACAGGACTGACCACGTCTACCACAGAACTGGCCTGGGACCCGCCA 
GTGCTGGCGGAGAGGAACGGGCGCATCATCAGCTACACCGTGGTGTTCCGAGACATCAACAGCCAACAGGAGCTGCA 
GAACATCACGACAGACACCCGCTTTACCCTTACTGGCCTCAAGCCAGACACCACTTACGACATCAAGGTCCGCGCAT 
GGACCAGCAAAGGCTCTGGCCCACTCAGCCCCAGCATCCAGTCCCGGACCATGCCGGTGGAGCAAGTGTTTGCCAAG 
AACTTCCGGGTGGCGGCTGCAATGAAGACGTCTGTGCTGCTCAGCTGGGAGGTTCCCGACTCCTATAAGTCAGCTGT 
GCCCTTTAAGATTCTGTACAATGGGCAGAGTGTGGAGGTGGACGGGCACTCGATGCGGAAGCTGATCGCAGACCTGC 
AGCCCAACACAGAGTACTCGTTTGTGCTGATGAACCGTGGCAGCAGCGCAGGGGGCCTGCAGCACCTGGTGTCCATC 
CGCACAGCCCCCGACCTCCTGCCTCACAAGCCGCTGCCTGCCTCTGCCTACATAGAGGACGGCCGCTTCGATCTCTC 
CATGCCCCATGTGCAAGACCCCTCGCTTGTCAGGTGGTTCTACATTGTTGTGGTACCCATTGACCGTGTGGGCGGGA 
GCATGCTGACGCCAAGGTGGAGCACACCCGAGGAACTGGAGCTGGACGAGCTTCTAGAAGCCATCGAGCAAGGCGGA 
GAGGAGCAGCGGCGGCGGCGGCGGCAGGCAGAACGTCTGAAGCCATATGTGGCTGCTCAACTGGATGTGCTCCCGGA 
GACCTTTACCTTGGGGGACAAGAAGAACTACCGGGGCTTCTACAACCGGCCCCTGTCTCCGGACTTGAGCT 
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ACCAGTGCTTTGTGCTTGCCTCCTTGAAGGAACCCATGGACCAGAAGCGCTATGCCTCCAGCCCCTACTCGGATGAG 
ATCGTGGTCCAGGTGACACCAGCCCAGCAGCAGGAGGAGCCGGAGATGCTGTGGGTGACGGGTCCCGTGCTGGCAGT 
CATCCTCATCATCCTCATTGTCATCGCCATCCTCTTGTTCAAAAGGAAAAGGACCCACTCTCCGTCCTCTAAGGATG 
AGCAGTCGATCGGACTGAAGGACTCCTTGCTGGCCCACTCCTCTGACCCTGTGGAGATGCGGAGGCTCAACTACCAG 
ACCCCAGGTATGCGAGACCACCCAeCCATCCCCATCACCGACCTGGCGGACAACATCGAGCGCCTCAAAGCCAACGA 
TGGCCTCAAGTTCTCCCAGGAGTATGAGTCCATCGACCCTGGACAGCAGTTCACGTGGGAGAATTCAAACCTGGAGG 
TGAACAAGCCCAAGAACCGCTATGCGAATGTCATCGCCTACGACCACTCTCGAGTCATCCTTACCTCTATCGATGGC 
GTCCCCGGGAGTGACTACATCAATGCCAACTACATCGATGGCTACCGCAAGCAGAATGCCTACATCGCCACGCAGGG 
CCCCCTGCCCGAGACCATGGGCGATTTCTGGAGAATGGTGTGGGAACAGCGCACGGCCACTGTGGTCATGATGACAC 
GGCTGGAGGAGAAGTCCCGGGTAAAATGTGATCAGTACTGGCCAGCCCGTGGCACCGAGACCTGTGGCCTTATTCAG 
GTGACCCTGTTGGACACAGTGGAGCTGGCCACATACACTGTGCGCACCTTCGCACTCCACAAGAGTGGCTCCAGTGA 
GAAGGGTGAGCTGCGTCAGTTTCAGTTCATGGCCTGGCCAGACCATGGAGTTCCTGAGTACCCAACTCCCATCCTGG 
CCTTCCTACGACGGGTCAAGGCCTGCAACCCCCTAGACGCAGGGCCCATGGTGGTGCACTGCAGCGCGGGCGTGGGC 
CGCACCGGCTGCTTCATCGTGATTGATGCCATGTTGGAGCGGATGAAGCACGAGAAGACGGTGGACATCTATGGCCA 
CGTGACCTGCATGCGATCACAGAGGAACTACATGGTGCAGACGGAGGACCAGTACGTGTTCATCCATGAGGCGCTGC 
TGGAGGCTGCCACGTGCGGCCACACAGAGGTGCCTGCCCGCAACCTGTATGCCCACATCCAGAAGCTGGGCCAAGTG 
CCTCCAGGGGAGAGTGTGACCGCCATGGAGCTCGAGTTCAAGTTGCTGGCCAGCTCCAAGGCCCACACGTCCCGCTT 
CATCAGCGCCAACCTGCCCTGCAACAAGTTCAAGAACCGGCTGGTGAACATCATGCCCTACGAATTGACCCGTGTGT 
GTCTGCAGCCCATCCGTGGTGTGGAGGGCTCTGACTACATCAATGCCAGCTTCCTGGATGGTTATAGACAGCAGAAG 
GCCTACATAGCTACACAGGGGCCTCTGGCAGAGAGCACCGAGGACTTCTGGCGCATGCTATGGGAGCACAATTCCAC 
CATCATCGTCATGCTGACCAAGCTTCGGGAGATGGGCAGGGAGAAATGCCACCAGTACTGGCCAGCAGAGCGCTCTG 
CTCGCTACCAGTACTTTGTTGTTGACCCGATGGCTGAGTACAACATGCCCCAGTATATCCTGCGTGAGTTCAAGGTC 
ACGGATGCCCGGGATGGGCAGTCAAGGACAATCCGGCAGTTCCAGTTCACAGACTGGCCAGAGCAGGGCGTGCCCAA 
GACAGGCGAGGGATTCATTGACTTCATCGGGCAGGTGCATAAGACCAAGGAGCAGTTTGGACAGGATGGGCCTATCA 
CGGTGCACTGCAGTGCTGGCGTGGGCCGCACCGGGGTGTTCATCACTCTGAGCATCGTCCTGGAGCGCATGCGCTAT 
GAGGGCGTGGTCGACATGTTTCAGACCGTGAAGACCCTGCGTACACAGCGTCCTGCCATGGTGCAGACAGAGGACCA 
GTATCAGCTGTGCTACCGTGCGGCCCTGGAGTACCTCGGCAGCTTTGACCACTATGCAACG TAA CTACCGCTCCCCT 
CTCCTCCGCCACCCCCGCCGTGGGGCTCCGGAGGGGACCCAGCTCCTCTGAGCCATACCGACCATCGTCCAGCCCTC 
CTACGCAGATGCTGTCACTGGCAGAGCACAGCCCACGGGGATCACAGCGTTTCAGGAACGTTGCCACACCAATCAGA 
GAGCCTAGAACATCCCTGGGCAAGTGGATGGCCCAGCAGGCAGGCACTGTGGCCCTTCTGTCCACCAGACCCACCTG 
GAGCCCGCTTCAAGCTCTCTGTTGCGCTCCCGCATTTCTCATGCTTCTTCTCATGGGGTGGGGTTGGGGCAAAGCCT 
CCTTTTTAATACATTAAGTGGGGTAGACTGAGGGATTTTAGCCTCTTCCCTCTGATTTTTCCTTTCGCGAATCCGTA 
TCTGCAGAATGGGCCACTGTAGGGGTTGGGGTTTATTTTGTTTTGTTTTTTTTTTTTTTTTGTATGACTTCTGCTGA 
AGGACAGAACATTGCCTTCCTCGTGCAGAGCTGGGGCTGCCAGCCTGAGCGGAGGCTCGGCCGTGGGCCGGGAGGCA 
GTGCTGATCCGGCTGCTCCTCCAGCCCTTCAGACGAGATCCTGTTTCAGCTAAATGCAGGGAAACTCAATGTTTTTT 
TAAGTTTTGTTTTCCCTTTAAAGCCTTTTTTTAGGCCACATTGACAGTGGTGGGCGGGGAGAAGATAGGGAACACTC 
ATCCCTGGTCGTCTATCCCAGTGTGTGTTTAACATTCACAGCCCAGAACCACAGATGTGTCTGGGAGAGCCTGGCAA 
GGCATTCCTCATCACCATCGTGTTTGCAAAGGTTAAAACAAAAACAAAAAACCACAAAAATAAAAAACAAAAAAAAC 
AAAAAACCCAAAAAAAAAAAAAAAAAGAGTCAGCCCTTGGCTTCTGCTTCAAACCCTCAAGAGGGGAAGCAACTCCG 
TGTGCCTGGGGTTCCCGAGGGAGCTGCTGGCTGACCTGGGCCCACAGAGCCTGGCTTTGGTCCCCAGCATTGCAGTA 
TGGTGTGGTGTTTGTAGGCTGTGGGGTCTGGCTGTGTGGCCAAGGTGAATAGCACAGGTTAGGGTGTGTGCCACACC 
CCATGCACCTCAGGGCCAAGCGGGGGCGTGGCTGGCCTTTCAGGTCCAGGCCAGTGGGCCTGGTAGCACATGTCTGT 
CCTCAGAGCAGGGGCCAGATGATTTTCCTCCCTGGTTTGCAGCTGTTTTCAAAGCCCCCGATAATCGCTCTTTTCCA 
CTCCAAGATGCCCTCATAAACCAATGTGGCAAGACTACTGGACTTCTATCAATGGTACTCTAATCAGTCCTTATTAT 
CCCAGCTTGCTGAGGGGCAGGGAGAGCGCCTCTTCCTCTGGGCAGCGCTATCTAGATAGGTAAGTGGGGGCGGGGAA 
GGGTGCATAGCTGTTTTAGCTGAGGGACGTGGTGCCGACGTCCCCAAACCTAGCTAGGCTAAGTCAAGATCAACATT 
CCAGGGTTGGTAATGTTGGATGATGAAACATTCATTTTTACCTTGTGGATGCTAGTGCTGTAGAGTTCACTGTTGTA 
CACAGTCTGTTTTCTATTTGTTAAGAAAAACTACAGCATCATTGCATAATTCTTGATGGTAATAAATTTGAATAATC 
AGATTTCT 
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GGAGAGGTGCGGGCCG7UVTCCGAGCCGAGCGAGAGGAATCCGGCAGTAGAGAGCGGACTCCAGCCGGCGGACCCTGC 
AGCCCTCGCCTGGGACAGCGGCGCGCTGGGCAGGCGCCCAAGAGAGCATCGAGCAGCGGAACCCGCGAAGCCGGCCC 
GCAGCCGCGACCCGCGCAGCCTGCCGCTCTCCCGCCGCCGGTCCGGGCAGC ATG AGGCGCGCGGCGCTCTGGCTCTG 
GCTGTGCGCGCTGGCGCTGAGCCTGCAGCTGGCCCTGCCGCAAATTGTGGCTACTAATTTGCCCCCTGAAGATCAAG 
ATGGCTCTGGGGATGACTCTGACAACTTCTCCGGCTCAGGTGCAGGTGCTTTGCAAGATATCACCTTGTCACAGCAG 
ACCCCCTCCACTTGGAAGGACACGCAGCTCCTGACGGCTATTCCCACGTCTCCAGAACCCACCGGCCTGGAGGCTAC 
AGCTGCCTCCACCTCCACCCTGCCGGCTGGAGAGGGGCCCAAGGAGGGAGAGGCTGTAGTCCTGCCAGAAGTGGAGC 
CTGGCCTCACCGCCCGGGAGCAGGAGGCCACCCCCCGACCCAGGGAGACCACACAGCTCCCGACCACTCATCAGGCC 
TCAACGACCACAGCCACCACGGCCCAGGAGCCCGCCACCTCCCACCCCCACAGGGACATGCAGCCTGGCCACCATGA 
GACCTCAACCCCTGCAGGACCCAGCCAAGCTGACCTT.CACACTCCCCACACAGAGGATGGAGGTCCTTCTGCCACCG 
AGAGGGCTGCTGAGGATGGAGCCTCCAGTCAGCTCCCAGCAGCAGAGGGCTCTGGGGAGCAGGACTTCACCTTTGAA 
ACCTCGGGGGAGAATACGGCTGTAGTGGCCGTGGAGCCTGACCGCCGGAACCAGTCCCCAGTGGATCAGGGGGCCAC 
GGGGGCCTCACAGGGCCTCCTGGACAGGAAAGAGGTGCTGGGAGGGGTCATTGCCGGAGGCCTCGTGGGGCTCATCT 
TTGCTGTGTGCCTGGTGGGTTTCATGCTGTACCGCATGAAGAAGAAGGACGAAGGCAGCTACTCCTTGGAGGAGCCG 
AAACAAGCCAACGGCGGGGCCTACCAGAAGCCCACCAAACAGGAGGAATTCTATGCC TGA CGCGGGAGCCATGCGCC 
CCCTCCGCCCTGCCACTCACTAGGCCCCCACTTGCCTCTTCCTTGAAGAACTGCAGGCCCTGGCCTCCCCTGCCACC 
AGGCCACCTCCCCAGCATTCCAGCCCCTCTGGTCGCTCCTGCCCACGGAGTCGTGGGTGTGCTGGGAGCTCCACTCT 
GCTTCTCTGACTTCTGCCTGGAGACTTAGGGCACCAGGGGTTTCTCGCATAGGACCTTTCCACCACAGCCAGCACCT 
GGCATCGCACCATTCTGACTCGGTTTCTCCAAACTGAAGCAGCCTCTCCCCAGGTCCAGCTCTGGAGGGGAGGGGGA 
TCCGACTGCTTTGGACCTAAATGGCCTCATGTGGCTGGAAGATCTGCGGGTGGGGCTTGGGGCTCACACACCTGTAG 
CACTTACTGGTAGGACCAAGCATCTTGGGGGGGTGGCCGCTGAGTGGCAGGGACAGGAGTCACTTTGTTTCGTGGGG 
AGGTCTAATCTAGATATCGACTTGTTTTTGCACATGTTTCCTCTAGTTCTTTGTTCATAGCCCAGTAGACCTTGTTA 
CTTCTGAGGTAAGTTAAGTAAGTTGATTCGGTATCCCCCCATCTTGCTTCCCTAATCTATGGTCGGGAGACAGCATC 
AGGGTTAAGAAGACTTTTTTTTTTTTTTTTTAAACTAGGAGAACCAAATCTGGAAGCCAAAATGTAGGCTTAGTTTG 
TGTGTTGTCTCTTGAGTTTGTCGCTCATGTGTGCAACAGGGTATGGACTATCTGTCTGGTGGCCCCGTTTCTGGTGG 
TCTGTTGGCAGGCTGGCCAGTCCAGGCTGCCGTGGGGCCGCCGCCTCTTTCAAGCAGTCGTGCCTGTGTCCATGCGC 
TCAGGGCCATGCTGAGGCCTGGGCCGCTGCCACGTTGGAGAAGCCCGTGTGAGAAGTGAATGCTGGGACTCAGCCTT 
CAGACAGAGAGGACTGTAGGGAGGGCGGCAGGGGCCTGGAGATCCTCCTGCAGACCACXCCCGTCCTGCCTGTGCGC 
CGTCTCCAGGGGCTGCTTCCTCCTGGAAATTGACGAGGGGTGTCTTGGGCAGAGCTGGCTCTGAGCGCCTCCATCCA 
AGGCCAGGTTCTCCGTTAGCTCCTGTGGCCCCACCCTGGGCCCTGGGCTGGAATCAGGAATATTTTCCAAAGAGTGA 
TAGTCTTTTGCTTTTGGCAAAACTCTACTTAATCCAATGGGTTTTTCCCTGTACAGTAGATTTTCCAAATGTAATAA 
ACTTT7VATATAAAGT 
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GCTCCGGCCAGCCGCGGTCCAGAGCGCGCGAGGTTCGGGGAGCTCCGCCAGGCTGCTGGTACCTGCGTCCGCCCGGC 
GAGCAGGACAGGCTGCTTTGGTTTGTGACCTCCAGGCAGGACGGCCATCCTCTCCAG AATG AAGATCTTCTTGCCAG 
TGCTGCTGGCTGCCCTTCTGGGTGTGGAGCGAGCCAGCTCGCTGATGTGCTTCTCCTGCTTGAACCAGAAGAGCAAT 
CTGTACTGCCTGAAGCCGACCATCTGCTCCGACCAGGACAACTACTGCGTGACTGTGTCTGCTAGTGCCGGCATTGG 
GAATCTCGTGACATTTGGCCACAGCCTGAGCAAGACCTGTTCCCCGGCCTGCCCCATCCCAGAAGGCGTCAATGTTG 
GTGTGGCTTCCATGGGCATCAGCTGCTGCCAGAGCTTTCTGTGCAATTTCAGTGCGGCCGATGGCGGGCTGCGGGCA 
AGCGTCACCCTGCTGGGTGCCGGGCTGCTGCTGAGCCTGCTGCCGGCCCTGCTGCGGTTTGGCCCC TGA CCGCCCAG 
ACCCTGTCCCCCGATCCCCCAGCTCAGGAAGGAAAGCCCAGCCCTTTCTGGATCCCACAGTGTATGGGAGCCCCTGA 
CTCCTCACGTGCCTGATCTGTGCCCTTGGTCCCAGGTCAGGCCCACCCCCTGCACCTCCACCTGCCCCAGCCCCTGC 
CTCTGCCCAAGTGGGCCAGCTGCCCTCACTTCTGGGGTGGATGATGTGACCTTCCTTGGGGGACTGCGGAAGGGACG 
AGGGTTCCCTGGAGTCTTACGGTCCAACATCAGACCAAGTCCCATGGACATGCTGACAGGGTCCCCAGGGAGACCGT 
GTCAGTAGGGATGTGTGCCTGGCTGTGTACGTGGGTGTGCAGTGCACGTGAGAGCACGTGGCGGCTTCTGGGGGCCA 
TGTTTGGGGAGGGAGGTGTGCCAGCAGCCTGGAGAGCCTCAGTCCCTGTAGCCCCCTGCCCTGGCACAGCTGCATGC 
ACTTCAAGGGCAGCCTTTGGGGGTTGGGGTTTCTGCCACTTCCGGGTCTAGGCCCTGCCCAAATCCAGCCAGTCCTG 
CCCCAGCCCACCCCCACATTGGAGCCCTCCTGCTGCTTTGGTGCCTCAAATAAATACAGATGTCCCC 
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CAGCAGGTCACAGCCCCTCGAGGCGACAGCGGCCCCGCCGCACCAGAGCAGTGGTACAGGC ATG GATGGGAAGAAAT 
GCAGCGTATGGATGTTCCTACCTCTTGTATTTACTTTGTTTACTTCAGCTGGATTGTGGATAGTATACTTCATAGCT 
GTGGAAGATGACAAAATTTTACCATTAAATTCAGCTGAAAGGAAACCTGGTGTGAAGCATGCACCATATATAAGCAT 
TGCAGGTGATGATCCTCCTGCAAGCTGTGTGTTTAGTCAAGTTATGAACATGGCAGCCTTCCTAGCCCTTGTGGTAG 
CTGTTCTGCGCTTCATACAACTGAAACCGAAGGTTTTAAACCCGTGGCTGAATATTAGTGGATTGGTGGCTCTGTGT 
CTGGCTTCCTTCGGAATGACCTTACTTGGTAATTTTCAGCTCACAAATGATGAAGAAATCCATAACGTCGGAACTTC 
CTTGACCTTTGGATTTGGCACATTGACCTGCTGGATCCAGGCTGCGCTGACACTCAAGGTCAACATCAAGAATGAAG 
GACGGAGAGTTGGAATTCCACGGGTTATTCTGTCGGCATCTATCACTCTCTGTGTGGTCCTCTACTTCATCCTCATG 
GCCCAAAGCATCCACATGTATGCAGCCAGGGTCCAGTGGGGCCTGGTCATGTGCTTCCTGTCTTATTTTGGCACCTT 
TGCCGTGGAGTTCCGGCATTACCGCTATGAGATTGTTTGCTCTGAGTACCAGGAGAATTTCCTAAGCTTCTCAGAAA 
GCCTGTCAGAAGCTTCTGAATATCAGACTGACCAGGTGTAAACCATCAGTTTTTCCTTGCTGGTGAGGTGGGTGTGA 
CAGTGGGGGAGGGGCCAGTAGGACACACTCACAGGACTTGACATAGAACCTCATTTCACACACACACACACACACAC 
ATTCATGGCCACATTTGCCAAATGAGCTTTTCAGGGCGAGTTATTTCTTTAATGAAAAAGCACAAGCCCTTATGTGT 
CGAAATACACGCTGTTACACTGAAAATATATGCACGACAGAGCAAGAAGCTTGTGCATGATCACTTCTTATCCGTCC 
CCTTCCCAGCACTCCCTCCTCTTCCCATTCTCTCCACATGTCTCAAGCACCCTACCGAGTAGGGCAGGCCAAATGTT 
CCTTGGGAGTAATGCCAACTCCCGACGTTGCCTTCAGGTCCAAAGGGCTTGGAACCAGCTCGTGAGGAAGTTCTGAA 
TCTGGCACTAATATTCTTGAGTGGATAATAGTGTATCATAGAATAGGACGGAAATTGTATTGAGATGTGACCCTGTG 
TCGCCTGTGGAAAGGCATAGTGAGAAGAACTTTCCCACGAAAGCCCCCTTCATCGTTGTTCAGTGGTCGGCTGTGTG 
GATCCCAGGAGAGACATATGCCACAGACTGTGAGAGCAAAGCCCGCCGCTGTGATCTGGACTTGATGCACTGTGACT 
GAGAATGATTTCCAAATGTGAATATGTGTAGGGACGTGGTCTATCAGGCCTGGAACAAGATGGGGGCAGTGAAGGTA 
TGGTTTAGTGTTTGCTTTCATAGTATGCCATGTACAATGTTTTATATTTCATAGTTTCTTTTAAGTAACTACCATGA 
GTCTCTCTAAGCCTCATGGACAAAGATGTAGACCAAATGCAAGAGCTGAGCTTGCTTTGGGTTCAACCATGATCAAA 
GAAAAACTGAGGTCACCTGCAGGCTTACGTGGGAAGCTAAGACAATATC 
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CTGCCTCCACTGCTCTGTGCTGGGATC ATG GAACTTGCACTGCTGTGTGGGCTGGTGGTGATGGCTGGTGTGATTCC 
AATCCAGGGCGGGATCCTGAACCTGAACAAGATGGTCAAGCAAGTGACTGGGAAAATGCCCATCCTCTCCTACTGGC 
CCTACGGCTGTCACTGCGGACTAGGTGGCAGAGGCCAACCCAAAGATGCCACGGACTGGTGCTGCCAGACCCATGAC 
TGCTGCTATGACCACCTGAAGACCCAGGGGTGCGGCATCTACAAGGACTATTACAGATACAACTTTTCCCAGGGGAA 
CATCCACTGCTCTGACAAGGGAAGCTGGTGTGAGCAGCAGCTGTGTGCCTGTGACAAGGAGGTGGCCTTCTGCCTGA 
AGCGCAACCTGGACACCTACCAGAAGCGACTGCGTTTCTACTGGCGGCCCCACTGCCGGGGGCAGACCCCTGGGTGC 
TAGAAGCCCACACCCTCTACCCTGTTCCTCAGCATGGAGCTCTGGCATCCCCACCTCAGTATCTAACCTGAACCAGC 
CTGGCTTTTCAAACACTCCGGGGGGAGGTAGTCCCAGCCTCCCCCGGAACCCTCTACCAATGCCTTCTGACCTTCTG 
AAGCTTTCCGAATCCTCCCAGTTGAGGCAGTAGCTGTGTCCTCTGAGGGTGGATGGGAATCTTGGGAGAAGCCCAAG 
CAAGGGAGCCCTCAGAGGTGGTGTTTGGACCAAAGCATCGGGGTGGGGGAGGGGTCTGCCGCTGTCCCCCACCTGCT 
GGCCCCCTTGTCCTTCCTCACCCCCTCCAATATAGTCTCGGAGCTACAACTGCAGCAGCCACTATAAAGGGCAATAT 
TGATCTTTCTGTCCATGTGGCTCTATCTTTTAAAACCTCAAGGCCCTCCACTGTCCTAAGATAAAGCCTCTCATAGG 
CACTGGGGACCCTGCACAGTCTGGCCATGTGACCCTCTCCCCAGGCAAGCTCTGAAGTCCCTGCAGGTGGAGGCCAT 
GCCTGTCTTAAACTCAGTTGCATCCCTGGTGCCCAAAGCAACACCAGAACCAAGAAGGAGCTCCATAAATCCTTCTT 
GGGTGAAGCCTAGACAAAGCCGCCAGGTCTTGTGGCTCCAGGCACCAGAGCCTTGAGTACTTTCTCCTGCCTCCAGG 
CATTGGCTCAGGGTGAATTACAAGGGGCTACTGAATGGCTATTACTTTCATCACGACTGATCCCCACCTCCTCAGGG 
TCAAAGGGCTACTTTCTGGAAGTCTCCCCAGGCTGACTCCTTCTCCCTGACTGCAAGGGCTCACTCCCTCCTCCAAG 
CTCCCACAATGCTTCATGGCTCTGCCGCTTACCTAGCTTGGCCTAGAGTGGCAAATGGAACTTCTCTGATCTCCCCC 
AACTAGACTGGAGCCCCCGAAGGATGGAGACCATGTCTGTGCCATCTCTGTTTCCCCTGTTTTCCCACATACTAGGT 
GCTCAATTCATGCCTGTGAATGGCGTGAGCCCATAATGGATACACAGAGGTTGCAGCAGATGGTGTGGGTACCTCAC 
CCAGATATCTTCCAGGCCCAAGGCCCCTCTCCCTGAGTGAGGCCAGGTGTTGGCAGCCAACTGCTCCAATCTGCCTC 
CTTCCCCTAAATACTGCCCTGGTCTAGTGGGAGCTGCCTTCCCCCTGCCCCACCTCTCCCACCAAGAGGCCACCTGT 
CACTCATGGCCAGGAGAGTGACACCATGGAGGGTACAATTGCCAGCTCCCCCGTGTCTGTGCAGCATTGTCTGGGTT 
GAATGACACTCTCAAATTGTTCCTGGGATCGGGCTGAGGCCAGGCCTCTCCTGGAACCACCTCTCTGCTTGGTCTGA 
CCCCTTGGCCTATCCAGTTTTCCTGGTTCCCTCACAGGTTTCTCCAGAAAGTACTCCCTCAGTAAAGCATTTGCACA 
AGAAAAAAAAAAAA 



21/47 

FIGURE 18A 



GGCTGAAAGAGCCTGAGCTGTGCCTCTCCATTCCACTGCTGTGGCAGGGTCAGAAATCTTGGATAGAGAAAACCTTT 
TGCAAACGGGAATGTATCTTTGTAATTCCTAGCACGAAAGACTCTAACAGGTGTTGCTGTGGCCAGTTCACCAACCA 
GCATATCCCCCCTCTGCCAAGTGCAACACCCAGCTVAAAATGAAGAGGAAAGCAAACAGGTGGAGACTCAGCCTGAGA 
AATGGTCTGTTGCCAAGCACACCCAGAGCTACCCAACAGATTCCTATGGAGTTCTTGAATTCCAGGGTGGCGGATAT 
TCCAATAAAGCCATGTATATCCGTGTATCCTATGACACCAAGCCAGACTCACTGCTCCATCTCATGGTGAAAGATTG 
GCAGCTGGAACTCCCCAAGCTCTT7VATATCTGTGCATGGAGGCCTCCAGAACTTTGAGATGCAGCCCAAGCTGAAAC 
AAGTCTTTGGGAAAGGCCTGATCAAGGCTGCTATGACCACCGGGGCCTGGATCTTCACCGGGGGTGTCAGCACAGGT 
GTTATCAGCCACGTAGGGGATGCCTTG7Uy\GACCACTCCTCCAAGTCCAGAGGCCGGGTTTGTGCTATAGGAATTGC 
TCCATGGGGCATCGTGGAGAATAAGGAAGACCTGGTTGGAAAGGATGTAACAAGAGTGTACCAGACCATGTCCAACC 
CTCTAAGTAAGCTCTCTGTGCTCAACAACTCCCACACCCACTTCATCCTGGCTGACAATGGCACCCTGGGCAAGTAT 
GGCGCCGAGGTGAAGCTGCGAAGGCTGCTGGAAAAGCACATCTCCCTCCAGAAGATCAACACAAGACTGGGGCAGGG 
CGTGCCCCTCGTGGGTCTCGTGGTGGAGGGGGGCCCTAACGTGGTGTCCATCGTCTTGGAATACCTGCAAGAAGAGC 
CTCCCATCCCTGTGGTGATTTGTGATGGCAGCGGACGTGCCTCGGACATCCTGTCCTTTGCGCACAAGTACTGTGAA 
GAAGGCGGAATAATAAATGAGTCCCTCAGGGAGCAGCTTCTAGTTACCATTCAGAAAACATTTAATTATAATAAGGC 
ACAATCACATCAGCTGTTTGCAATTATAATGGAGTGCATGAAGAAGAAAGAACTCGTCACTGTGTTCAGAATGGGTT 
CTGAGGGCCAGCAGGACATCGAGATGGCAATTTTAACTGCCCTGCTGAAAGGAACAAACGTATCTGCTCCAGATCAG 
CTGAGCTTGGCACTGGCTTGGAACCGCGTGGACATAGCACGAAGCCAGATCTTTGTCTTTGGGCCCCACTGGACGCC 
CCTGGGAAGCCTGGCACCCCCGACGGACAGCAAAGCCACGGAGAAGGAGAAGAAGCCACCCATGGCCACCACCAAGG 
GAGGAAGAGGAAAAGGGAAAGGCAAGAAGAAAGGGAAAGTGAAAGAGGAAGTGGAGGAAGAAACTGACCCCCGGAAG 
ATAGAGCTGCTGAACTGGGTGAATGCTTTGGAGCAAGCGATGCTAGATGCTTTAGTCTTAGATCGTGTCGACTTTGT 
GAAGCTCCTGATTGAAAACGGAGTGAACATGCAACACTTTCTGACCATTCCGAGGCTGGAGGAGCTCTATAACACAA 
GACTGGGTCCACCAAACACACTTCATCTGCTGGTGAGGGATGTGAAAAAGAGCAACCTTCCGCCTGATTACCACATC 
AGCCTCATAGACATCGGGCTCGTGCTGGAGTACCTCATGGGAGGAGCCTACCGCTGCAACTACACTCGGAAAAACTT 
TCGGACCCTTTACAACAACTTGTTTGGACCAAAGAGGCCTAAAGCTCTTAAACTTCTGGGAATGGAAGATGATGAGC 
CTCCAGCTAAAGGGAAGAAAAAAAAAAAAAAGAAAAAGGAGGAAGAGATCGACATTGATGTGGACGACCCTGCCGTG 
AGTCGGTTCCAGTATCCCTTCCACGAGCTGATGGTGTGGGCAGTGCTGATGAAACGCCAGAAAATGGCAGTGTTCCT 
CTGGCAGCGAGGGGAAGAGAGCATGGCCAAGGCCCTGGTGGCCTGCAAGCTCTACAAGGCCATGGCCCACGAGTCCT 
CCGAGAGTGATCTGGTGGATGACATCTCCCAGGACTTGGATAACAATTCCAAAGACTTCGGCCAGCTTGCTTTGGAG 
TTATTAGACCAGTCCTATAAGCATGACGAGCAGATCGCTATGAAACTCCTGACCTACGAGCTGAAAAACTGGAGCAA 
CTCGACCTGCCTCAAACTGGCCGTGGCAGCCAAACACCGGGACTTCATTGCTCACACCTGCAGCCAGATGCTGCTGA 
CCGATATGTGGATGGGAAGACTGCGGATGCGGAAGAACCCCGGCCTGAAGGTTATCATGGGGATTCTTCTACCCCCC 
ACCATCTTGTTTTTGGAATTTCGCACATATGATGATTTCTCGTATCAAACATCCAAGGAAAACGAGGATGGCAAAGA 
AAAAGAAGAGGAAAATACGGATGCAAATGCAGATGCTGGCTCAAGAAAGGGGGATGAGGAGAACGAGCATAAAAAAC 
AGAGAAGTATTCCCATCGGAACAAAGATCTGTGAATTCTATAACGCGCCCATTGTCAAGTTCTGGTTTTACACAATA 
TCATACTTGGGCTACCTGCTGCTGTTTAACTACGTCATCCTGGTGCGGATGGATGGCTGGCCGTCCCTCCAGGAGTG 
GATCGTCATCTCCTACATCGTGAGCCTGGCGTTAGAGAAGATACGAGAGATCCTCATGTCAGAACCAGGCAAACTCA 
GCCAGAAAATCAAAGTTTGGCTTCAGGAGTACTGGAACATCACAGATCTCGTGGCCATTTCCACATTCATGATTGGA 
GCAATTCTTCGCCTACAGAACCAGCCCTACATGGGCTATGGCCGGGTGATCTACTGTGTGGATATCATCTTCTGGTA 
CATCCGTGTCCTGGACATCTTTGGTGTCAACAAGTATCTGGGGCCATACGTGATGATGATTGGAAAGATGATGATCG 
ACATGCTGTACTTTGTGGTCATCATGCTGGTCGTGCTCATGAGTTTCGGAGTAGCCCGTCAAGCCATTCTGCATCCA 
GAGGAGAAGCCCTCTTGGAAACTGGCCCGAAACATCTTCTACATGCCCTACTGGATGATCTATGGAGAGGTGTTTGC 
AGACCAGATAGACCTCTACGCCAT'GGAAATTAATCCTCCTTGTGGTGAGAACCTATATGATGAGGAGGGCAAGCGGC 
TTCCTCCCTGTATCCCCGGCGCCTGGCTCACTCCAGCACTCATGGCGTGCTATCTACTGGTCGCCAACATCCTGCTG. 
GTGAACCTGCTGATTGCTGTGTTCAACAATACTTTCTTTGAAGTAAAATCAATATCCAACCAGGTGTGGAAGTTCCA 
GCGATATCAGCTGATTATGACATTTCATGACAGGCCAGTCCTGCCCCCACCGATGATCATTTTAAGCCACATCTACA 
TCATCATTATGCGTCTCAGCGGCCGCTGCAGGAAAAAGAGAGAAGGGGACCAAGAGGAACGGGATCGTGGATTGAAG 
CTCTTCCTTAGCGACGAGGAGCTAAAGAGGCTGCATGAGTTCGAGGAGCAGTGCGTGCAGGAGCACTTCCGGGAGAA 
GGAGGATGAGCAGCAGTCGTCCAGCGACGAGCGCATCCGGGTCACTTCTGAAAGAGTTGAAAATATGTCAATGAGGT 
TGGAAGAAATCAATGAAAGAGAAACTTTTATGAAAACTTCCCTGCAGACTGTTGACCTTCGACTTGCTCAGCTAGAA 
GAATTATCTAACAGAATGGTGAATGCTCTTGAAAATCTTGCGGGAATCGACAGGTCTGACCTGATCCAGGCACGGTC 
CCGGGCTTCTTCTGAATGTGAGGCAACGTATCTTCTCCGGCAAAGCAGCATCAATAGCGCTGATGGCTACAGCTTGT 
ATCGATATCATTTTAACGGAGAAGAGTTATTATTTGAGGATACATCTCTCTCCACGTCACCAGGGACAG 
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GAGTCAGGAAAAAAACCTGTTCCTTCCGTATAAAGGAAGAGAAGGACGTGAAAACGCACCTAGTCCCAGAATGTCAG 
AACAGTCTTCACCTTTCACTGGGCACAAGCACATCAGCAACCCCAGATGGCAGTCACCTTGCAGTAGATGACTTAAA 
GAACGCTGAAGAGTCAAAATTAGGTCCAGATATTGGGATTTCAAAGGAAGATGATGAAAGACAGACAGACTCTAAAA 
AAGAAGAAACTATTTCCCCAAGTTTAAATAAAACAGATGTGATACATGGACAGGACAAATCAGATGTTCAAAACACT 
CAGCTAACAGTGGAAACGACAAATATAGAAGGCACTATTTCCTATCCCCTGGAAGAAACCAAAATTACACGCTATTT 
CCCCGATGAAACGATCAATGCTTGTAAAACAATGAAGTCCAGAAGCTTCGTCTATTCCCGGGGAAGAAAGCTGGTCG 
GTGGGGTT7VACCAGGATGTAGAGTACAGTTCAATCACGGACCAGCAATTGACGACGGAATGGCAATGCCAAGTTCAA 
AAGATCACGCGCTCTCATAGCACAGATATTCCTTACATTGTGTCGGAAGCTGCAGTGCAAGCTGAGCAAAAAGAGCA 
GTTTGCAGATATGCAAGATGAACACCATGTCGCTGAAGCAATTCCTCGAATCCCTCGCTTGTCCCTAACCATTACTG 
ACAGAAATGGGATGGAAAACTTACTGTCTGTGAAGCCAGATCAAACTTTGGGATTCCCATCTCTCAGGTCAAAAAGT 
TTACATGGACATCCTAGGAATGTGAAATCCATTCAGGGAAAGTTAGACAGATCTGGACATGCCAGTAGTGTAAGCAG 
CTTAGTAATTGTGTCTGGAATGACAGCAGAAGAAAAAAAGGTTAAGAAAGAGAAAGCTTCCACAGAAACTGAATGCT 
AGTCTGTTTTGTTTCTTTAATTTTTTTTTTTAACAGTCAGAAACCCACTAATGGGTGTCATCTTGGCCCATCCTAAA 
CACATMTCCAATTTCCTAAAAACATTTTCCCTTAAAAAATTTTGGAAATTCAGACTTGATTTACAATTTAATGCACT 
AAAAGTAGTATTTTGTTAGXATATGTTAGTAGGCTTAGTTTTTTCAGTTGCAGTAGTATCAAATGAAAGTGATGATA 
CTGTAACGAAGATAAATTGGCTAATCAGTATACAAGATTATACAATCTCTTTATTACTGAGGGCCACCAAATAGCCT 
AGGAAGTGCCCTCGAGCACTGAAGTCACCATTAGGTCACTCAAGAAGTAAGCAACTAGCTGGGCACAGTGGCTCATG 
CCTGTAATCCTAGCACTTTGGGAGGCCAAGGCAGAAAGATAGCTTGAGTCCAGGAGTTTGAGACCAGCCTGGGCAAC 
ATAGTGATACCCCATCTCTTAAAAAAAAAAAAAAAAAA 



t 
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FIGURE 19 



CTGAATCTTCGTTTCTCTCCCAGGGACCCTCCATTTTCCATATCCAGGATU^TGTGATGCGCCACAGGTATCAGCGT 
CTGGATCGCCACTTCACGTTTTAGCCACAAGTGACTCAGTGGAAGATCCAGAGTCAACAGAGGCTCGTCAGGAAGAT 
GTCTACAGAAAAGGTAGACCTiJUU^GGAGGAAGCTGGGGAAAAAGAGGTGTGCGGAGACCAGATCAAAGGACCGGACA 
AAGAGGAGGAACCACCAGCTGCTGCATCCCATGGCCAGGGGTGGCGTCCAGGTGGCAGAGCAGCTAGGAACGC/U^GG 
CCTGTIACCTGGGGCCAGACACCCTGCTCTCCCGGCCATGGTCAACGACCCTCCAGTACCTGCCTTACTGTGGGCCCA 
GGAGGTGGGCCAAGTCTTGGCAGGCCGTGCCCGCAGGCTGCTGCTGCAGTTTGGGGTGCTCTTCTGCACCATCCTCC 
TTTTGCTCTGGGTGTCTGTCTTCCTCTATGGCTCCTTCTACTATTCCTATATGCCGACAGTCAGCCACCTCAGCCCT 
GTGCATTTCTACTACAGGACCGACTGTGATTCCTCCACCACCTCACTCTGCTCCTTCCCTGTTGCCAATGTCTCGCT 
GACTAAGGGTGGACGTGATCGGGTGCTGATGTATGGACAGCCGTATCGTGTTACCTTAGAGCTTGAGCTGCCAGAGT 
CCCCTGTGAATCAAGATTTGGGCATGTTCTTGGTCACCATTTCCTGCTACACCAGAGGTGGCCGAATCATCTCCACT 
TCTTCGCGTTCGGTGATGCTGCATTACCGCTCAGACCTGCTCCAGATGCTGGACACACTGGTCTTCTCTAGCCTCCT 
GCTATTTGGCTTTGCAGAGCAGAAGCAGCTGCTGGAGGTGGAACTCTACGCAGACTATAGAGAGAACTCGTACGTGC 
CGACCACTGGAGCGATCATTGAGATCCACAGCAAGCGCATCCAGCTGTATGGAGCCTACCTCCGCATCCACGCGCAC 
TTCACTGGGCTCAGATACCTGCTATACAACTTCCCGATGACCTGCGCCTTCATAGGTGTTGCCAGCAACTTCACCTT 
CCTCAGCGTCATCGTGCTCTTCAGCTACATGCAGTGGGTGTGGGGGGGCATCTGGCCCCGACACCGCTTCTCTTTGC 
AGGTTAACATCCGAAAAAGAGACAATTCCCGGAAGGAAGTCCAACGAAGGATCTCTGCTCATCAGCCAGGGCCTGAA 
GGCCAGGAGGAGTCAACTCCGCAATCAGATGTTACAGAGGATGGTGAGAGCCCTGAAGATCCCTCAGGGACAGAGGG 
TCAGCTGTCCGAGGAGGAGAAACCAGATCAGCAGCCCCTGAGCGGAGAAGAGGAGCTAGAGCCTGAGGCCAGTGATG 
GTTCAGGCTCCTGGGAAGATGCAGCTTTGCTGACGGAGGCCAACCTGCCTGCTCCTGCTCCTGCTTCTGCTTCTGCC 
CCTGTCCTAGAGACTCTGGGCAGCTCTGAACCTGCTGGGGGTGCTCTCCGACAGCGCCCCACCTGCTCTAGTTCCTG 
AAGAAAAGGGGCAGACTCCTCACATTCCAGCACTTTCCCACCTGACTCCTCTCCCCTCGTTTTTCCTTCAATAAACT 
ATTTTGTGTCAGCTTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 2 OA 



AACCGGCTGCGG6G ATG GGGCCACCGCTCCCGCTGCTGCTGCTGCTACTGCTGCTGCTGCCGCCACGCGTCCTGCCT 
GCCGCCCCTTCGTCCGTCCCCCGCGGCCGGCAGCTCCCGGGGCGTCTGGGCTGCCTGCTCGAGGAGGGCCTCTGCGG 
AGCGTCCGAGGCCTGTGTGAACGATGGAGTGTTTGGAAGGTGCCAGAAGGTTCCGGCAATGGACTTTTACCGCTACG 
AGGTGTCGCCCGTGGCCCTGCAGCGCCTGCGCGTGGCGTTGCAGAAGCTTTCCGGCACAGGTTTCACGTGGCAGGAT 
GACTATACTCAGTATGTGATGGACCAGGAACTTGCAGACCTCCCGAAAACCTACCTGAGGCGTCCTGAAGCATCCAG 
CCCAGCCAGGCCCTCAAAACACAGCGTTGGCAGCGAGAGGAGGTACAGTCGGGAGGGCGGTGCTGCCCTGGCCAACG 
CCCTCCGACGCCACCTGCCCTTCCTGGAGGCCCTGTCCCAGGCCCCAGCCTCAGACGTGCTCGCCAGGACCCATACG 
GCGCAGGACAGACCCCCCGCTGAGGGTGATGACCGCTTCTCCGAGAGCATCCTGACCTATGTGGCCCACACGTCTGC 
GCTGACGTACCCTCCCGGGCCCCGGACCCAGCTCCGGGAGGACCTCCTGCCGCGGACCCTCGGCCAGCTCCAGCCAG 
ATGAGCTCAGCCCTAAGGTGGACAGTGGTGTGGACAGACACCATCTGATGGCGGCCCTCAGTGCCTATGCTGCCCAG 
AGGCCCCCAGCTCCCCCCGGGGAGGGCAGCCTGGAGCCACAGTACCTTCTGCGTGCACCCTCAAGAATGCCCAGGCC 
TTTGCTGGCACCAGCCGCCCCCCAGAAGTGGCCTTCACCTCTGGGAGATTCCGAAGACCCCTCCAGCACAGGCGATG 
GAGCACGGATTCATACCCTCCTGAAGGACCTGCAGAGGCAGCCGGCTGAGGTGAGGGGCCTGAGTGGCCTGGAGCTG 
GACGGCATGGCTGAGCTGATGGCTGGCCTGATGCAAGGCGTGGACCATGGAGTAGCTCGAGGCAGCCCTGGGAGAGC 
GGCCCTGGGAGAGTCTGGAGAACAGGCGGATGGCCCCAAGGCCACCCTCCGTGGAGACAGCTTTCCAGATGACGGAG 
TGCAGGACGACGATGATAGACTTTACCAAGAGGTCCACCGTCTGAGTGCCACACTCGGGGGCCTCCTGCAGGACCAC 
GGGTCTCGACTCTTACCTGGAGCCCTCCCCTTTGCAAGGCCCCTCGACATGGAGAGGAAGAAGTCCGAGCACCCTGA 
GTCTTCCCTGTCTTCAGAAGAGGAGACTGCCGGAGTGGAGAACGTCAAGAGCCAGACGTATTCCAAAGATCTGCTGG 
GGCAGCAGCCGCATTCGGAGCCCGGGGCCGCTGCGTTTGGGGAGCTCCAAAACCAGATGCCTGGGCCCTCGAAGGAG 
GAGCAGAGCCTTCCAGCGGGTGCTCAGGAGGCCCTCAGCGACGGCCTGCAATTGGAGGTCCAGCCTTCCGAGGAAGA 
GGCGCGGGGCTACATCGTGACAGACAGAGACCCCCTGCGCCCCGAGGAAGGAAGGCGGCTGGTGGAGGACGTCGCCC 
GCCTCCTGCAGGTGCCCAGCAGTGCGTTCGCTGACGTGGAGGTTCTCGGACCAGCAGTGACCTTCAAAGTGAGCGCC 
AATGTCCAAAACGTGACCACTGAGGATGTGGAGAAGGCCACAGTTGACAACAAAGACAAACTGGAGGAAACCTCTGG 
ACTGAAAATTCTTCAAACCGGAGTCGGGTCGAAAAGCAAACTCAAGTTCCTGCCTCCTCAGGCGGAGCAAGAAGACT 
CCACCAAGTTCATCGCGCTCACCCTGGTCTCCCTCGCCTGCATCCTGGGCGTCCTCCTGGCCTCTGGCCTCATCTAC 
TGCCTCCGCCATAGCTCTCAGCACAGGCTGAAGGAGAAGCTCTCGGGACTAGGGGGCGACCCAGGTGCAGATGCCAC 
TGCCGCCTACCAGGAGCTGTGCCGCCAGCGTATGGCCACGCGGCCACCAGACCGACCTGAGGGCCCGCACACGTCAC 
GCATCAGCAGCGTCTCATCCCAGTTCAGCGACGGGCCGATCCCCAGCCCCTCCGCACGCAGCAGCGCCTCATCCTGG 
TCCGAGGAGCCTGTGCAGTCCAACATGGACATCTCCACCGGCCACATGATCCTGTCCTACATGGAGGACCACCTGAA 
GAACAAGAACCGGCTGGAGAAGGAGTGGGAAGCGCTGTGCGCCTACCAGGCGGAGCCCAACAGCTCGTTCGTGGCCC 
AGAGGGAGGAGAACGTGCCCAAGAACCGCTCCCTGGCTGTGCTGACCTATGACCACTCCCGGGTCCTGCTGAAGGCG 
GAGAACAGCCACAGCCACTCAGACTACATCAACGCTAGCCCCATCATGGATCACGACCCGAGGAACCCCGCGTACAT 
CGCCACCCAGGGACCGCTGCCCGCCACCGTGGCTGACTTTTGGCAGATGGTGTGGGAGAGCGGCTGCGTGGTGATCG 
TCATGCTGACACCCCTCGCGGAGAACGGCGTCCGGCAGTGCTACCACTACTGGCCGGATGAAGGCTCCAATCTCTAC 
CACATCTATGAGGTGAACCTGGTCTCCGAGCACATCTGGTGTGAGGACTTCCTGGTGAGGAGCTTCTATCTGAAGAA 
CCTGCAGACCAACGAGACGCGCACCGTGACGCAGTTCCACTTCCTGAGTTGGTATGACCGAGGAGTCCCTTCCTCCT 
CAAGGTCCCTCCTGGACTTCCGCAGAAAAGTAAACAAATGCTACAGGGGCCGTTCTTGTCCGATAATTGTTCATTGC 
AGTGACGGTGCAGGCCGGAGCGGCACCTACGTCCTGATCGACATGGTTCTCAACAAGATGGCCAAAGGTGCTAAAGA 
GATTGATATCGCAGCGACCCTGGAGCACTTGAGGGACCAGAGACCCGGCATGGTCCAGACGAAGGAGCAGTTTGAGT 
TCGCGCTGACAGCCGTGGCTGAGGAGGTGAACGCCATCCTCAAGGCCCTTCCCCAGTGAGCGGCAGCGTCAGGGGCC 
TCAGGGGAGCCCCCACCCCACGGATGTTGTCAGGAATCATGATCTGACTTTAATTGTGTGTCTTCTATTATAACTGC 
ATAGTAATAGGGCCCTTAGCTCTCCCGTAGTCAGCGCAGTTTAGCAGTTAAAAGTGTATTTTTGTTTAATCAAACAA 
TAATAAAGAGAGATTTGTGGAAAAATCCAGTTACGGGTGGAGGGGAATCGGTTCATCAATTTTCACTTGCTTAAAAA 
AAATACTTTTTCTTAAAGCACCCGTTCACCTTCTTGGTTGAAGTTGTGTTAACAATGCAGTAGCCAGCACGTTCGAG 
GCGGTTTCCAGGAAGAGTGTGCTTGTCATCTGCCACTTTCGGGAGGGTGGATCCACTGTGCAGGAGTGGCCGGGGAA 
GCTGGCAGCACTCAGTGAGGCCGCCCGGCACACAAGGCACGTTTGGCATTTCTCTTTGAGAGAGTTTATCATTGGGA 
GAAGCCGCGGGGACAGAACTGAACGTCCTGCAGCTTCGGGGCAAGTGAGACAATCACAGCTCCTCGCTGCGTCTCCA 
TCAACACTGCGCCGGGTACCATGGACGGCCCCGTCAGCCACACCGGTCAGCCCAAGCAGAGTGATTCAGGGGCTCCC 
CGGGGGCAGACACCTGTGCACCCCATGAGTAGTGCCCACTTGAGGCTGGCACTCCCCTGACCTCACCTTTGCAAAGT 
TACAGATGCACCCCAACATTGAGATGTGTTTTTAATGTTAAAATATTGATTTCTACGTTATGAAAACAGATGCCCCC 
GTGAATGCTTACCTGTGAGATAACCACAACCAGGAAGAACAAATCTGGGCATTGAGCAAGCTATGAGGGTCCCCGGG 
AGCACACG7KACCCTGCCAGGCCCCCGCTGGCTCCTCCAGGCACGTCCCGGACCTGTGGGGCCCCAGAGA 
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FIGURE 2 OB 



GGGGACATTTCCCTCCTGGGAGAG7VAGGAGATCAGGGCAACTCGGAGAGGGCTGCGAGCATTTCCCTCCCGGGAGAG 
GAAATCAGGGCGACCTGCACGCACTGCGTAGAGCCTGGAAGGGAAGTGAGAAACCAGCCGACCGGCCCTGCCCCTCT 
TCCCGGGATCACTTAATGAACCACGTGTTTTGACATCATGTTAACCTAAGCACGTACAGATGATTCCGGATTTGACA 
AAATAACATTTGAGTATCCGATTCGCCATCACCCCTACCCCCGAAATAGGACAACTCACTTCATTGACCAGGATGAT 
CACATGGAAGGCGGCGCAGAGGCAGCTGTGTGGGCTGCAGATTTCCTGTGTGGGGTTCAGCGTATAAAACGCACCTC 
CATCCCGCCCTTCCCACAGCATTCCTCCATCTTAGATAGATGGTACTCTCCAAAGGCCCTACCAGAGGGAACACGGC 
CTACTGAGCGGACAGAATGATGCCAAAATATTGCTTATGTCTCTACATGGTATTGTAATGAATATCTGCTTTAATAT 
AGCTATCATTTCTTTTCCAAAATTACTTCTCTTTATCTGGAATTTAATTAATCGAAATGAATTTATCTGAATATAGG 
AAGCATATGCCTACTTGTAATTTCTAACTACTTATGTTTGAAGAGAAACCTCCGGTGTGAGATATACAAATATATTT 
AATTGTGTCATATTAAACTTCCCGGAATTC 
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FIGURE 21 



GCATCTGGTTTGTCAGATCCGAGAGGCTCTGAAACTGCGGAGCGGCCACCGGACGCCTTCTGGAGCAGGTAGCAGCA 
TGCAGCCGCCTCCAAGTCTGTGCGGACGCGCCCTGGTTGCGCTGGTTCTTGCCTGCGGCCTGTCGCGGATCTGGGGA 
GAGGAGAGAGGCTTCCCGCCTGACAGGGCCACTCCGCTTTTGCAAACCGCAGAGATAATGACGCCACCCACTAAGAC 
CTTATGGCCCAAGGGTTCCAACGCCAGTCTGGCGCGGTCGTTGGCACCTGCGGAGGTGCCTAAAGGAGACAGGACGG 
CAGGATCTCCGCCACGCACCATCTCCCCTCCCCCGTGCCAAGGACCCATCGAGATCAAGGAGACTTTCAAATACATC 
AACACGGTTGTGTCCTGCCTTGTGTTCGTGCTGGGGATCATCGGGAACTCCACACTTCTGAGAATTATCTACAAGAA 
CAAGTGCATGCGAAACGGTCCCAATATCTTGATCGCCAGCTTGGCTCTGGGAGACCTGCTGCACATCGTCATTGACA 
TCCCTATCAATGTCTACAAGCTGCTGGCAGAGGACTGGCCATTTGGAGCTGAGATGTGTAAGCTGGTGCCTTTCATA 
CAGAAAGCCTCCGTGGGAATCACTGTGCTGAGTCTATGTGCTCTGAGTATTGACAGATATCGAGCTGTTGCTTCTTG 
GAGTAGAATTAAAGGAATTGGGGTTCCAAAATGGACAGCAGTAGAAATTGTTTTGATTTGGGTGGTCTCTGTGGTTC 
TGGCTGTCCCTGAAGCCATAGGTTTTGATATAATTACGATGGACTACAAAGGAAGTTATCTGCGAATCTGCTTGCTT 
CATCCCGTTCAGAAGACAGCTTTCATGCAGTTTTACAAGACAGCAAAAGATTGGTGGCTGTTCAGTTTCTATTTCTG 
CTTGCCATTGGCCATCACTGCATTTTTTTATACACTAATGACCTGTGAAATGTTGAGAAAGAAAAGTGGCATGCAGA 
TTGCTTTAAATGATCACCTAAAGCAGAGACGGGAAGTGGCCAAAACCGTCTTTTGCCTGGTCCTTGTCTTTGCCCTC 
TGCTGGCTTCCCCTTCACCTCAGCAGGATTCTGAAGCTCACTCTTTATAATCAGAATGATCCCAATAGATGTGAACT 
TTTGAGCTTTCTGTTGGTATTGGACTATATTGGTATCAACATGGCTTCACTGAATTCCTGCATTAACCCAATTGCTC 
TGTATTTGGTGAGCAAAAGATTCAAAAACTGCTTTAAGTCATGCTTATGCTGCTGGTGCCAGTCATTTGAAGAAAAA 
CAGTCCTTGGAGGAAAAGCAGTCGTGCTTAAAGTTCAAAGCTAATGATCACGGATATGACAACTTCCGTTCCAGTAA 
TAAATACAGCTCATCTTGAAAGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



27/47 



FIGURE 22 

MKENYCLQAALVCLGMLCHSHAFAPERRGHLRPSFHGHHEKGKEGQVLQRSKRGWVWNQFFVIEEYTGPDPVLVGRL 

HSDIDSGDGNIKYILSGEGAGTIFVIDDKSGNIHATKTLDREERAQYTLMAQAVDRDTNRPLEPPSEFIVKVQDIND 
NPPEFLHETYHANVPERSNVGTSVIQVTASDADDPTYGNSAKLVYSILEGQPYFSVEAQTGIIRTALPNMDREAKEE 
YHVVIQAKDMGGHMGGLSGTTKVTITLTDVNDNPPKFPQRLYQMSVSEAAVPGEEVGRVKAKDPDIGENGLVTYNIV 
DGDGMESFEITTDYETQEGVIKLKKPVDFETERAYSLKVEAANVHIDPKFISNGPFKDTVTVKISVEDADEPPMFLA 
PSYIHEVQENAAAGTVVGRVHAKDPDAANSPIRYSIDRHTDLDRFFTINPEDGFIKTTKPLDREETAWLNITVFAAE 
IHNRHQEAQVPVAIRVLDVNDNAPKFAAPYEGFICESDQTKPLSNQPIVTISADDKDDTANGPRFIFSLPPEIIHNP 
NFTVRDNRDNTAGVYARRGGFSRQKQDLYLLPIVISDGGIPPMSSTNTLTIKVCGCDVNGALLSCNAEAYILNAGLS 
TGALIAILACIVILLVIVVLFVTLRRQKKEPLIVFEEEDVRENIITYDDEGGGEEDTEAFDIATLQNPDGINGFIPR 
KDIKPEYQYMPRPGLRPAPNSVDVDDFINTRIQEADNDPTAPPYDSIQIYGYEGRGSVAGSLSSLESATTDSDLDYD 
YLQNWGPRFKKLADLYGSKDTFDDDS 

Signal sequence, 
amino acids 1-22 

Transmembrane domain, 
amino acids 617-637 

N-glycosylation sites, 
amino acids 455-458, 540-543 

Glycosaminoglycan attachment sites, 
amino acids 83-86, 93-96 

N-myristoylation sites. 

amino acids 108-113, 215-220, 242-247, 246-251, 247-252, 399-404, 594-599, 599- 
604, 614-619, 618-623, 749-754, 753-758, 787-792 

Cadherins extracellular repeated domain signatures, 
amino acids 147-157, 256-266, 476-486 

Cadherin cytoplasmic region, 
amino acids 641-789 

Cadherin domains. 

amino acids 59-150, 164-259, 273-375, 388-479, 492-593 
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FIGURE 23 

MYGKSSTRAVLLLLGIQLTALWPIAAVEIYTSRVLEAVNGTDARLKCTFSSFAPVGDALTVTWNFRPLDGGPEQFVF 

YYHIDPFQPMSGRFKDRVSWDGNPERYDASILLWKLQFDDNGTYTCQVKNPPDVDGVIGEIRLSWHTVRFSEIHFL 
ALAIGSACALMIIIVIVWLFQHYRKKRWAERAHKVVEIKSKEEERLNQEKKVSVYLEDTD 

Signal sequence, 
amino acids 1-21 

N-glycosylation sites, 
amino acids 39-42^ 118-121 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 205-208 

N-myristoylation sites. 

amino acids 15-20, 119-124, 159-164 

Immunoglobulin domain, 
amino acids 40-125 
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FIGURE 24 

MAEQQGRELEAECPVCWNPFNNTFHTPKMLDCCHSFCVECLAHLSLVTPARRRLLCPLCRQPTVLASGQPVTDLPTD 
TAMLTLLRLEPHHVILEGHQLCLKDQPKSRYFLRQPRVYTLDLGPQPGGQTGPPPDTASATVSTPILIPSHHSLREC 
FRNPQFRIFAYLMAVILSVTLLLIFSIFWTKQFLWGVG 

Transmembrane domain, 
amino acids 162-182 

N-glycosylation site, 
amino acids 21-24 

N-myristoylation site, 
amino acids 68-73 

Zinc finger, C3HC4 type, signature, 
amino acids 32-41 



30/47 



FIGURE 25 

MATAAGATYFQRGSLFWFTVITLSFGYYTWVVFWPQSIPYQNLGPLGPFTQYLVDHHHTLLCNGYWLAWLIHVGESL 
YAIALCKHKGITSGRAQLLWFLQTFFFGIASLTILIAYKRKRQKQT 

Transmembrane domain, 
amino acids 57-77, 92-112 

N-myristoylation site, 
amino acids 87-92 

Leucine zipper pattern, 
amino acids 4 6-67 
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FIGURE 26 

MASKIGSRRWMLQLIMQLGSVLLTRCPFWGCFSQLMLYAERAEARRKPDIPVPYLYFDMGAAVLCASFMSFGVKRRW 

FALGAALQLAISTYAAYIGGYVHYGDWLKVRMYSRTVAIIGGFLVLASGAGELYRRKPRSRSLQSTGQVFLGIYLIC 
VAYSLQHSKEDRLAYLNHLPGGELMIQLFFVLYGILALAFLSGYYVTLAAQILAVLLPPVMLLIDGNVAYWHNTRRV 
EFWNQMKLLGESVGIFGTAVILATDG 

Transmembrane domains. 

amino acids 6-26, 51-71, 106-126, 138-158, 174-194, 201-221, 237-256 

Glycosaminoglycan attachment site, 
amino acids 125-128 

N-myristoylation site, 
amino acids 245-250 
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. FIGURE 27 

GRGSPLALLIRMKTLLFGVWALLALILCPGVPEELFEVSIWPSQALVEFGQSLVCNCSTTCPDPGPSGIETFLKKTQ 
VDKGPQWKEFLLEDVTENSILQCFFSCAGIQKDTSLGITVYQPPEQVILELQPAWVAVDEAFTVKCHVPSVAPLESL 
TLALLQGNQELHRKNFTSLAVASQRAEVIISVRAQKENDRCNSSCHAELDLSLQGGRLFQGSSPIRIVRIFEFSQSP 
HIWVSSLLEAGMAETVSCEVARVFPAKEVMFHMFLEDQELSSSLSWEGDTAWANATIRTMEAGDQELSCFASLGAME 
QKTRKLVHSYSFPPPILELKESYPLAGTDINVTCSGHVLTSPSPTLRLQGAPDLPAGEPAWLLLTAREEDDGXNFSC 
EASLWQGQRLMKTTVIQLHILKPQLEESSCPGKQTWLEGMEHTLACVPKGNPAPALVCTWNGWFDLEVPQBCAT 

Signal sequence, 
amino acids 1-30 

N-glycosylation sites. 

amino acids 56-59, 169-172, 196-199, 285-288, 339-342, 382-385 

N-myristoylation sites. 

amino acids 3-8, 242-247, 335-340, 380-385, 425-430 

Intercellular adhesion molecule (ICAM) homology, 
amino acids 17-123 
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FIGURE 28 

MLPRLLLLICAPLCEPAELFLIASPSHPTEGSPVTLTCKMPFLQSSDAQFQFCFFRDTRALGPGWSSSPKLQIAAMW 

KEDTGSYWCEAQTMASKVLRSRRSQINVHIPVSRPILMLRAPRAQAAVEDVLELHCEALRGSPPILYWFYHEDITLG 
SRSAPSGGGASFNLSLTEEHSGNYSCEANNGLGAQRSEAVTLNFTVPTGARSNHLTSGVIEGLLSTLGPATVALLFC 
YGLKRKIGRRSARDPLRSLPALPQEFTYLNSPTPGQLQPIYENVNVVSGDEVYSLAYYNQPEQESVAAETLGTHMED 
KVSLDIYSRLRKANITDVDYEDAM 

Signal sequence, 
amino acids 1-16 

Transmembrane domain, 
amino acids 211-231 

N-glycosylation sites. 

amino acids 167-170, 177-180, 197-200, 322-325 

Glycosaminoglycan attachment site, 
amino acids 160-163 

N-myristoylat ion sites. 

amino acids 31-36, 82-87, 161-166, 163-168, 176-181, 187-192, 203-208, 212-217, 
216-221 

Amidation site, 
-amino acids 238-241 

Immunoglobulin domains, 
amino acids 31-88, 126-182 
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FIGURE 29 

MLLWASLLAFAPVCGQSAAAHKPVISVHPPWTTFFKGERVTLTCNGFQFYATEKTTWYHRHYWGEKLTLTPGNTLEV 
RESGLYRCQARGSPRSNPVRLLFSSDSLILQAPYSVFEGDTLVLRCHRRRKEKLTAVKYTWNGNILSISNKSWDLLI 
PQASSNNNGNYRCIGYGDENDVFRSNFKIIKIQELFPHPELKATDSQPTEGNSVNLSCETQLPPERSDTPLHFNFFR 
DGEVILSDWSTYPELQLPTVWRENSGSYWCGAETVRGNIHKHSPSLQIHVQRIPVSGVLLETQPSGGQAVEGEMLVL 
VCSVAEGTGDTTFSWHREDMQESLGRKTQRSLRAELELPAIRQSHAGGYYCTADNSYGPVQSMVLNVTVRETPGNRD 
GLVAAGATGGLLSALLLAVALLFHCWRRRKSGVGFLGDETRLPPAPGPGESSHSICPAQVELQSLYVDVHPKKGDLV 
YSEIQTTQLGEEEEANTSRTLLEDKDVSWYSEVKTQHPDNSAGKISSKDEES 

Signal sequence, 
amino acids 1-15 

Transmembrane domain, 
amino acids 387-407 

N-glycosylation sites. 

amino acids 147-150, 209-212, 374-377, 478-481 

Glycosaminoglycan attachment site, 
amino acids 416-419 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 413-416 

Tyrosine kinase phosphorylation site. 
amino acids 457-463 

N-myristoylation sites. 

amino acids 15-20, 81-86, 89-94, 140-145, 163-168, 205-210, 257-262, 315-320, 
355-360, 382-387, 386-391, 391-396, 394-399, 395-400 

Amidation site, 
amino acids 332-335 

Immunoglobulin domains . 

amino acids 37-87, 116-169, 205-263, 303-361 
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FIGURE 30 

MTVIRFFPAASATKRVLPPVLRVSSPRTWNPNVPESPRIPAPRLPKRMSGAPTAGAALMLCAATAVLLSAQGGPVQS 
KSPRFASWDEMNVLAHGLLQLGQGLREHAERTRSQLSALERRLSACGSACQGTEGSTDLPLAPESRVDPEVLHSLQT 
QLKAQNSRIQQLFHKVAQQQRHLEKQHLRIQHLQSQFGLLDHKHLDHEVAKPARRKRLPEMAQPVDPAHNVSRLHRL 
PRDCQELFQVGERQSGLFEIQPQGSPPFLVNCKMTSDGGWTVIQRRHDGSVDFNRPWEAYKAGFGDPHGEFWLGLEK 
VHSITGDRNSRLAVQLRDWDGNAELLQFSVHLGGEDTAYSLQLTAPVAGQLGATTVPPSGLSVPFSTWDQDHDLRRD 
KNCAKSLSGGWWFGTCSHSNLNGQYFRSIPQQRQKLKKGIFWKTWRGRYYPLQATTMLIQPMAAEAAS 

Transmembrane domain, 
amino acids 4 9-69 

N-glycosylation site, 
amino acids 224-227 

cAMP- and cGMP-dependent protein kinase phosphorylation sites, 
amino acids 46-49, 118-121 

N-myristoylation sites. 

amino acids 50-55, 129-134, 341-346, 357-362 

Fibrinogen beta and gamma chains C-terminal domain signature, 
amino acids 396-408 

Fibrinogen beta and gamma chains, 
amino acids 231-447 
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FIGURE 31 

MLLWVILLVLAPVSGQFARTPRPIIFLQPPWTTVFQGERVTLTCKGFRFYSPQKTKWYHRYLGKEILRETPDNILEV 
QESGEYRCQAQGSPLSSPVHLDFSSEMGFPHAAQANVELLGSSDLLT 

Signal sequence, 
amino acids 1-15 

N-myristoylation site, 
amino acids 89-94 
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FIGURE 32 

MAGPRPSPWARLLLAALISVSLSGTLANRCKKAPVKSCTECVRVDKDCAYCTDEMFRDRRCNTQAELLAAGCQRESI 
VVMESSFQITEETQIDTTLRRSQMSPQGLRVRLRPGEERHFELEVFEPLESPVDLYILMDFSNSMSDDLDNLKKMGQ 
NLARVLSQLTSDYTIGFGKFVDKVSVPQTDMRPEKLKEPWPNSDPPFSFKNVISLTEDVDEFRNKLQGERISGNLDA 
PEGGFDAILQTAVCTRDIGWRPDSTHLLVFSTESAFHYEADGANVLAGIMSRNDERCHLDTTGTYTQYRTQDYPSVP 
TLVRLLAKHNIIPIFAVTNYSYSYYEKLHTYFPVSSLGVLQEDSSNIVELLEEAFNRIRSNLDIRALDSPRGLRTEV 
TSKMFQKTRTGSFHIRRGEVGIYQVQLRALEHVDGTHVCQLPEDQKGNIHLKPSFSDGLKMDAGIICDVCTCELQKE 
VRSARCSFNGDFVCGQCVCSEGWSGQTCNCSTGSLSDIQPCLREGEDKPCSGRGECQCGHCVCYGEGRYEGQFCEYD 
NFQCPRTSGFLCNDRGRCSMGQCVCEPGWTGPSCDCPLSNATCIDSNGGICNGRGHCECGRCHCHQQSLYTDTICEI 
NYSAIHPGLCEDLRSCVQCQAWGTGEKKGRTCEECNFKVKMVDELKRAEEVVVRCSFRDEDDDCTYSYTMEGDGAPG 
PNSTVLVHKKKDCPPGSFWWLIPLLLLLLPLLALLLLLCWKYCACCKACLALLPCCNRGHMVGFKEDHYMLRENLMA 
SDHLDTPMLRSGNLKGRDWRWKVTNNMQRPGFATHAASINPTELVPYGLSLRLARLCTENLLKPDTRECAQLRQEV 
EENLNEVYRQISGVHKLQQTKFRQQPNAGKKQDHTIVDTVLMAPRSAKPALLKLTEKQVEQRAFHDLKVAPGYYTLT 
ADQDARGMVEFQEGVELVDVRVPLFIRPEDDDEKQLLVEAIDVPAGTATLGRRLVNITIIKEQARDVVSFEQPEFSV 
SRGDQVARIPVIRRVLDGGKSQVSYRTQDGTAQGNRDYIPVEGELLFQPGEAWKELQVKLLELQEVDSLLRGRQVRR 
FHVQLSNPKFGAHLGQPHSTTIIIRDPDELDRSFTSQMLSSQPPPHGDLGAPQNPNAKAAGSRKIHFNWLPPSGKPM 
GYRVKYWIQGDSESEAHLLDSKVPSVELTNLYPYCDYEMKVCAYGAQGEGPYSSLVSCRTHQEVPSEPGRLAFNVVS 
STVTQLSWAEPAETNGEITAYEVCYGLVNDDNRPIGPMKKVLVDNPFCNRMLLIENLRESQPYRYTVKARNGAGWGPE 
REAIINLATQPKRPMSIPIIPDIPIVDAQSGEDYDSFLMYSDDVLRSPSGSQRPSVSDDTEHLVNGRMDFAFPGSTN 
SLHRMTTTSAAAYGTHLSPHVPHRVLSTSSTLTRDYNSLTRSEHSHSTTLPRDYSTLTSVSSHDSRLTAGVPDTPTR 
LVFSALGPTSLRVSWQEPRCERPLQGYSVEYQLLNGGELHRLNIPNPAQTSVVVEDLLPNHSYVFRVRAQSQEGWGR 
EREGVITIESQVHPQSPLCPLPGSAFTLSTPSAPGPLVFTALSPDSLQLSWERPRRPNGDIVGYLVTCEMAQGGGPA 
TAFRVDGDSPESRLTVPGLSENVPYKFKVQARTTEGFGPEREGIITIESQDGGPFPQLGSRAGLFQHPLQSEYSSIT 
TTHTSATEPFLVDGPTLGAQHLEAGGSLTRHVTQEFVSRTLTTSGTLSTHMDQQFFQT 

Signal sequence, 
amino acids 1-23 
Transmembrane domain, 
amino acids 711-731 
N-glycosylation sites. 

amino acids 327-330, 491-494, 579-582, 617-620, 695-698, 980-983, 1523-1526 

Glycosaminoglycan attachment site, 
amino acids 513-516 

Tyrosine kinase phosphorylation sites. 

amino acids 43-50, 401-408, 674-682, 1596-1604 

N-myristoylation sites. 

amino acids 24-29, 153-158, 234-239, 420-425, 477-482, 487-492, 560-565, 

587-592, 588-593, 782-787, 1031-1036, 1089-1094, 1093-1098, 1383-1388, 

1563-1568, 1613-1618, 1614-1619, 1635-1640, 1676-1681, 1719-1724, 1739-1744 

Amidation sites. 

amino acids 875-878, 974-977 

Cell attachment sequence. 

amino acids 1003-1006 

EGF-like domain cysteine pattern signatures, 
amino acids 479-490, 562-573 

Integrins beta chain cysteine-rich domain signature, 
amino acids 512-525, 590-603 
Integrin, beta chain homology. 

amino acids 37-455 
Calx-beta domain, 
amino acids 979-1084 
Fibronectin type III domain. 

amino acids 1127-1208, 1220-1310, 1458-1542, 1571-1658 
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FIGURE 33 

MTLDRPGEGATMLKTFTVLLFCIRMSLGMTSIVMDPQPELWIESNYPQAPWENITLWCRSPSRISSKFLLLKDKTQM 
TWIRPSHKTFQVSFLIGALTESNAGLYRCCYWKETGWSKPSKVLELEAPGQLPKPIFWIQAETPALPGCNVNILCHG 
WLQDLVFMLFKEGYAEPVDYQVPTGTiyiAIFSIDNLTPEDEGVYICRTHIQMLPTLWSEPSNPLKLVVAGLYPKPTLT 
AHPGPIMAPGESLNLRCQGPIYGMTFALMRVEDLEKSFYHKKTIKNEANFFFQSLKIQDTGHYLCFYYDASYRGSLL 
SDVLKIWVTDTFPKTWLLARPSAVVQMGQNVSLRCRGPVDGVGLALYKKGEDKPLQFLDATSIDDNTSFFLNNVTYS 
DTGIYSCHYLLTWKTSIRMPSHNTVELMVVDKPPKPSLSAWPSTVFKLGKAITLQCRVSHPVLEFSLEWEERETFQR 
FSVNGDFIISNVDGKGTGTYSCSYRVETHPNMWSHRSEPLKLMGPAGYLTWNYVLNEAIRLSLIMQLVALLLVVLWI 
RWKCRRLRIREAWLLGTAQGVTMLFIVTALLCCGLCNGVLIEETEIVMPTPKPELWAETNFPLAPWKNLTLWCRSPS 
GSTKEFVLLKDGTGWIATRPASEQVRAAFPLGALTQSHTGSYHCHSWEEMAVSEPSEALELVGTDILPKPVISASPT 
IRGQELQLRCKGWLAGMGFALYKEGEQEPVQQLGAVGREAFFTIQRMEDKDEGNYSCRTHTEKLPFKWSEPSEPLEL 
VIKEMYPKPFFKTWASPWTPGARVTFNCSTPHQHMSFILYKDGSEIASSDRSWASPGASAAHFLIISVGIGDGGNY 
SCRYYDFSIWSEPSDPVELVVTEFYPKPTLLAQPGPVVFPGKSVILRCQGTFQGMRFALLQEGAHVPLQFRSVSGNS 
ADFLLHTVGAEDSGNYSCIYYETTMSNRGSYLSMPLMIWVTDTFPKPWLFAEPSSVVPMGQNVTLWCRGPVHGVGYI 
LHKEGEATSMQLWGSTSNDGAFPITNISGTSMGRYSCCYHPDWTSSIKIQPSNTLELLVTGLLPKPSLLAQPGPMVA 
PGENMTLQCQGELPDSTFVLLKEGAQEPLEQQRPSGYRADFWMPAVRGEDSGIYSCVYYLDSTPFAASNHSDSLEIW 
VTDKPPKPSLSAWPSTMFKLGKDITLQCRGPLPGVEFVLEHDGEEAPQQFSEDGDFVINNVEGKGIGNYSCSYRLQA 
YPDIWSEPSDPLELVGAAGPVAQECTVGNIVRSSLIWVWALGVVLAIEWKKWPRLRTRGSETDGRDQTIALEECN 
QEGEPGTPANSPSSTSQRISVELPVPI 

Signal sequence, 
amino acids 1-28 
Transmembrane domains . 

amino acids 517-537, 555-575, 1261-1281 
N-glycosylation sites. 

amino acids 53-56, 338-341, 374-377, 381-384, 607-610, 747-750, 798-801, 
8'46-849, 939-942, 986-989, 1027-1030, 1082-1085, 1147-1150, 1223-1226 
Tyrosine kinase phosphorylation sites, 
amino acids 287-295, 1125-1132 
N-myristoylat ion sites. 

amino acids 102-107, 145-150, 195-200, 254-259, 305-310, 336-341, 349-354, 
388-393, 480-485, 555-560, 573-578, 656-661, 709-714, 746-751, 792-797, 814-819, 
828-833, 840-845, 844-849, 845-850, 897-902, 901-906, 933-938, 938-943, 953-958, 
984-989, 1015-1020, 1030-1035, 1130-1135, 1222-1227, 1276-1281, 1315-1320 
Immunoglobulin domains. 

amino acids 51-108, 145-201, 241-298, 336-394, 434-486, 605-662, 696-752, 
792-851, 888-944, 984-1040, 1080-1136, 1176-1228 
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FIGURE 34 

MAPEPAPGRTMVPLVPALVMLGLVAGAHGDSKPVFIKVPEDQTGLSGGVASFVCQATGEPKPRITWMKKGKKVSSQR 
FEVIEFDDGAGSVLRIQPLRVQRDEAIYECTATNSLGEINTSAKLSVLEEEQLPPGFPSIDMGPQLKVVEKARTATM 
LCAAGGNPDPEISWFKDFLPVDPATSNGRIKQLRSGALQIESSEESDQGKYECVATNSAGTRYSAPANLYVRVRRVA 
PRFSIPPSSQEVMPGGSVNLTCVAVGAPMPYVKWMMGAEELTKEDEMPVGRNVLELSNVVRSANYTCVAISSLGMIE 
ATAQVTVKALPKPPIDLWTETTATSVTLTWDSGNSEPVTYYGIQYRAAGTEGPFQEVDGVATTRYSIGGLSPFSEY 
AFRVLAVNSIGRGPPSEAVRARTGEQAPSSPPRRVQARMLSASTMLVQWEPPEEPNGLVRGYRVYYTPDSRRPPNAW 
HKHNTDAGLLTTVGSLLPGITYSLRVLAFTAVGDGPPSPTIQVKTQQGVPAQPADFQAEVESDTRIQLSWLLPPQER 
IIMYELVYWAAEDEDQQHECVTFDPTSSYTLEDLKPDTLYRFQLAARSDMGVGVFTPTIEARTAQSTPSAPPQKVMCV 
SMGSTTVRVSWVPPPADSRNGVITQYSVAHEAVDGEDRGRHVVDGISREHSSWDLVGLEKWTEYRVWVRAHTDVGPG 
PESSPVLVRTDEDVPSGPPRKVEVEPLNSTAVHVYWKLPVPSKQHGQIRGYQVTYVRLENGEPRGLPIIQDVMLAEA 
QWRPEESEDYETTISGLTPETTYSVTVAAYTTKGDGARSKPKIVTTTGAVPGRPTMMISTTAMNTALLQWHPPKELP 
GELLGYRLQYCRADEARPNTIDFGKDDQHFTVTGLHKGTTYIFRLAAKNRAGLGEEFEKEIRTPEDLPSGFPQNLHV 
TGLTTSTTELAWDPPVLAERNGRIISYTVVFRDINSQQELQNITTDTRFTLTGLKPDTTYDIKVRAWTSKGSGPLSP 
SIQSRTMPVEQVFABCNFRVAAAMKTSVLLSWEVPDSYKSAVPFKILYNGQSVEVDGHSMRKLIADLQPNTEYSFVLM 
NRGSSAGGLQHLVSIRTAPDLLPHKPLPASAYIEDGRFDLSMPHVQDPSLVRWFYIWVPIDRVGGSMLTPRWSTPE 
ELELDELLEAIEQGGEEQRRRRRQAERLKPYVAAQLDVLPETFTLGDKKNYRGFYNRPLSPDLSYQCFVLASLKEPM 
DQKRYASSPYSDEIVVQVTPAQQQEEPEMLWVTGPVLAVILIILIVIAILLFKRKRTHSPSSKDEQSIGLKDSLLAH 
SSDPVEMRRLNYQTPGMRDHPPIPITDLADNIERLKANDGLKFSQEYESIDPGQQFTWENSNLEVNKPKNRYANVIA 
YDHSRVILTSIDGVPGSDYINANYIDGYRKQNAYIATQGPLPETMGDFWRMVWEQRTATVVMMTRLEEKSRVKCDQY 
WPARGTETCGLIQVTLLDTVELATYTVRTFALHKSGSSEKRELRQFQFMAWPDHGVPEYPTPILAFLRRVKACNPLD 
AGPMWHCSAGVGRTGCFIVIDAMLERMKHEKTVDIYGHVTCMRSQRNYMVQTEDQYVFIHEALLEAATCGHTEVPA 
RNLYAHIQKLGQVPPGESVTAMELEFKLLASSKAHTSRFISANLPCNKFKNRLVNIMPYELTRVCLQPIRGVEGSDY 
INASFLDGYRQQKAYIATQGPLAESTEDFWRMLWEHNSTIIVMLTKLREMGREKCHQYWPAERSARYQYFWDPMAE 
YNMPQYILREFKVTDARDGQSRTIRQFQFTDWPEQGVPKTGEGFIDFIGQVHKTKEQFGQDGPITVHCSAGVGRTGV 
FI TLS I VLERMRYEG WDMFQT VKTLRTQRPAMVQTEDQYQLC YRAALEYLGS FDH YAT 

Signal sequence, 
amino acids 1-2 9 
Transmembrane domain, 
amino acids 1262-1282 
N-glycosylation sites. 

amino acids 117-120, 250-253, 295-298, 721-724, 966-969, 1696-1699, 1731-1734 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 71-74, 1286-1289 

Tyrosine kinase phosphorylation sites. 

amino acids 97-105, 773-780, 1229-1237, 1687-1694 

N-myristoylation sites. 

amino acids 22-27, 44-49, 47-52, 214-219, 246-251, 305-310, 368-373, 442-447, 
470-475, 481-486, 739-744, 818-823, 881-886, 926-931, 1081-1084, 1301-1306, 
1349-1354, 1362-1367, 1399-1404, 1468-1473, 1551-1556, 1688-1693, 1790-1795, 
1807-1812, 1842-1847, 1847-1852 
Amidation site, 
amino acids 69-72 
Leucine zipper pattern, 
amino acids 1262-1283 

Myb DNA-binding domain repeat signature 1. 
amino acids 1151-1159 

Tyrosine specific protein phosphatases active sites. 

amino acids 1546-1558, 1837-1849 

Immunoglobulin domains. 

amino acids 47-109, 149-209, 246-300 

Fibronectin type III domains. 

amino acids 319-401, 413-500, 512-594, 606-696, 708-809, 821-904, 915-1000 
Protein-tyrosine phosphatase homology, 
amino acids 1375-1606, 1664-1897 



40/47 



FIGURE 35 

MRRAALWLWLCALALSLQLALPQIVATNLPPEDQDGSGDDSDNFSGSGAGALQDITLSQQTPSTWKDTQLLTAIPTS 
PEPTGLEATAASTSTLPAGEGPKEGEAVVLPEVEPGLTAREQEATPRPRETTQLPTTHQASTTTATTAQEPATSHPH 
RDMQPGHHETSTPAGPSQADLHTPHTEDGGPSATERAAEDGASSQLPAAEGSGEQDFTFETSGENTAVVAVEPDRRN 
QSPVDQGATGASQGLLDRKEVLGGVIAGGLVGLIFAVCLVGFMLYRMKKKDEGSYSLEEPKQANGGAYQKPTKQEEFYA 

Signal sequence, 
amino acids 1-22 

Transmembrane domain, 
amino acids 252-272 

N-glycosylation site, 
amino acids 4 3-4 6 

Glycosaminoglycan attachment sites, 
amino acids 45-48, 47-50 

Tyrosine kinase phosphorylation site, 
amino acids 27 9-28 6 

N-myristoylation sites. 

amino acids 46-51, 82-87, 183-188, 238-243, 241-246, 254-259, 255-260, 259-264, 
263-268 

Syndecans signature, 
amino acids 276-289 



Syndecan domain . 
amino acids 3-308 
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FIGURE 36 

MKIFLPVLLAALLGVERASSLMCFSCLNQKSNLYCLKPTICSDQDNYCVTVSASAGIGNLVTFGHSLSKTCSPACPI 
PEGVNVGVASMGISCCQSFLCNFSAADGGLRASVTLLGAGLLLSLLPALLRFGP 

Signal sequence, 
amino acids 1-20 

Transmembrane domain, 
amino acids 108-128 

N-glycosylation site, 
amino acids 99-102 

N-myristoylation sites. 

amino acids 14-19, 58-63, 80-85, 89-94, 105-110, 106-111, 117-122 

u-PAR/Ly-6 domain, 
amino acids 21-100 
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FIGURE 37 

MDGKKCSVWMFLPLVFTLFTSAGLWIVYFIAVEDDKILPLNSAERKPGVKHAPYISIAGDDPPASCVFSQVMNMAAF 

LALVVAVLRFIQLKPKVLNPWLNISGLVALCLASFGMTLLGNFQLTNDEEIHNVGTSLTFGFGTLTCWIQAALTLKV 
NIKNEGRRVGIPRVILSASITLCVVLYFILMAQSIHMYAARVQWGLVMCFLSYFGTFAVEFRHYRYEIVCSEYQENF 
LSFSESLSEASEYQTDQV 

Transmembrane domains . 

amino acids 7-27, 65-85, 97-117, 131-151, 166-186 

N-glycosylation site, 
amino acids 100-103 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 4-7 

N-myristoylation sites. 

amino acids 132-137, 140-145, 199-201 

Amidation sites, 

amino acids 2-5, 159-162 
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FIGURE 38 

MELALLCGLVVl^GVIPIQGGILNLNKMVKQVTGBCMPILSYWPyGCHCGLGGRGQPKDATDWCCQTHDCCT 
GCGIYKDYYRYNFSQGNIHCSDKGSWCEQQLCACDKEVAFCLKRNLDTYQKRLRFYWRPHCRGQTPGC 

Signal sequence, 
amino acids 1-17 

N-glycosylation site, 
amino acids 8 9-92 

N-myristoylation sites. 

amino acids 20-25, 45-50, 93-98, 140-145 

Phospholipase A2 histidine active site, 
amino acids 63-70 

Phospholipase A2 aspartic acid active site, 
amino acids 108-118 

Phospholipase A2 homology, 
amino acids 21-145 
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FIGURE 39 



MYIRVSYDTKPDSLLHLMVKDWQLELPKLLISVHGGLQNFEMQPKLKQVFGKGLIKAAMTTGAWIFTGGVSTGVISH 
VGDALKDHSSKSRGRVCAIGIAPWGIVENKEDLVGKDVTRVYQTMSNPLSKLSVLNNSHTHFILADNGTLGKYGAEV 
KLRRLLEKHISLQKINTRLGQGVPLVGLVVEGGPNVVSIVLEYLQEEPPIPVVICDGSGRASDILSFAHKYCEEGGI 
INESLREQLLVTIQKTFNYNKAQSHQLFAIIMECMKKKELVTVFRMGSEGQQDIEMAILTALLKGTNVSAPDQLSLA 
LAWNRVDIARSQIFVFGPHWTPLGSLAPPTDSKATEKEKKPPMATTKGGRGKGKGKKKGKVKEEVEEETDPRKIELL 
NWVNALEQAMLDALVLDRVDFVKLLIENGVNMQHFLTIPRLEELYNTRLGPPNTLHLLVRDVKKSNLPPDYHISLID 
IGLVLEYLMGGAYRCNYTRKNFRTLYNNLFGPKRPKALKLLGMEDDEPPAKGKKKKKKKKEEEIDIDVDDPAVSRFQ 
YPFHELMVWAVLMKRQKMAVFLWQRGEESMAECALVACKLYKAMAHESSESDLVDDISQDLDNNSKDFGQLALELLDQ 
SYKHDEQIAMKLLTYELKNWSNSTCLKLAVAAKHRDFIAHTCSQMLLTDMWMGRLRMRKNPGLKVIMGILLPPTILF 
LEFRTYDDFSYQTSKENEDGKEKEEENTDANADAGSRKGDEENEHKKQRSIPIGTKICEFYNAPIVKFWFYTISYLG 
YLLLFNYVILVRMDGWPSLQEWIVISYIVSLALEKIREILMSEPGKLSQKIKVWLQEYWNITDLVAISTFMIGAILR 
LQNQPYMGYGRVIYCVDIIFWYIRVLDIFGVNKYLGPYVMMIGKMMIDMLYFWIMLVVLMSFGVARQAILHPEEKP 
SWKLARNIFYMPYWMIYGEVFADQIDLYAMEINPPCGENLYDEEGKRLPPCIPGAWLTPALMACYLLVANILLVNLL 
lAVFNNTFFEVKSISNQVWKFQRYQLIMTFHDRPVLPPPMIILSHIYIIIMRLSGRCRKKREGDQEERDRGLKLFLS 
DEELKRLHEFEEQCVQEHFREKEDEQQSSSDERIRVTSERVENMSMRLEEINERETFMKTSLQTVDLRLAQLEELSN 
RMVNALENLAGIDRSDLIQARSRASSECEATYLLRQSSINSADGYSLYRYHFNGEELLFEDTSLSTSPGTGVRKKTC 
SFRIKEEKDVKTHLVPECQNSLHLSLGTSTSATPDGSHLAVDDLKNAEESKLGPDIGISKEDDERQTDSKKEETISP 
SLNKTDVIHGQDKSDVQNTQLTVETTNIEGTISYPLEETKITRYFPDETINACKTMKSRSFVYSRGRKLVGGVNQDV 
EYSSITDQQLTTEWQCQVQKITRSHSTDIPYIVSEAAVQAEQKEQFADMQDEHHVAEAIPRIPRLSLTITDRNGMEN 
LLSVKPDQTLGFPSLRSKSLHGHPRNVKSIQGKLDRSGHASSVSSLVIVSGMTAEEKBCVKKEKASTETEC 



Transmembrane domains. 

amino acids 759-779, 828-848, 857-877, 893-913, 976-996, 992-1012, 1031-1051 
N-glycosylation sites. 

amino acids 133-136, 144-147, 233-236, 298-301, 478-481, 601-604, 635-638, 

638-641, 830-833, 1006-1009, 1121-1124, 1312-1315 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 1228-1231 

Tyrosine kinase phosphorylation sites, 
amino acids 697-704, 891-898 
N-myristoylation sites. 

amino acids 35-40, 53-58, 68-73, 69-74, 102-107, 211-216, 229-234, 296-301, 
473-478, 728-733, 747-752, 1166-1171, 1259-1264, 1268-1273, 1319-1324 
Amidation sites. 

amino acids 362-365, 513-516, 968-971, 1374-1377 
Ion transport protein homology, 
amino acids 789-1005 
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FIGURE 40 

MSTEKVDQKEEAGEKEVCGDQIKGPDKEEEPP7VAASHGQGWRPGGRAARNARPEPGARHPALPAMVNDPPVPALLWA 

QEVGQVLAGRARRLLLQFGVLFCTILLLLWVSVFLYGSFYYSYMPTVSHLSPVHFYYRTDCDSSTTSLCSFPVANVS 
LTKGGRDRVLMYGQPYRVTLELELPESPVNQDLGMFLVTISCYTRGGRIISTSSRSVMLHYRSDLLQMLDTLVFSSL 
LLFGFAEQKQLLEVELYADYRENSYVPTTGAIIEIHSKRIQLYGAYLRIHAHFTGLRYLLYNFPMTCAFIGVASNFT 
FLSVIVLFSYMQWVWGGIWPRHRFSLQVNIRKRDNSRKEVQRRISAHQPGPEGQEESTPQSDVTEDGESPEDPSGTE 
GQLSEEEKPDQQPLSGEEELEPEASDGSGSWEDAALLTEANLPAPAPASASAPVLETLGSSEPAGGALRQRPTCSSS 

Transmembrane domains , 

amino acids 94-114, 134-154, 292-312, 308-328 

N-glycosylation sites, 
amino acids 152-155, 306-309 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 350-353 

Tyrosine kinase phosphorylation site, 
amino acids 240-248 

N-myristoylation sites. 

amino acids 44-49, 81-86, 96-101, 302-307, 361-366 

Leucine zipper patterns, 
amino acids 84-105, 91-112 
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FIGURE 41 

MGPPLPLLLLLLLLLPPRVLPAAPSSVPRGRQLPGRLGCLLEEGLCGASEACVNDGVFGRCQECVPAMDFYRYEVSPV 
ALQRLRVALQKLSGTGFTWQDDYTQYVMDQELADLPKTYLRRPEASSPARPSKHSVGSERRYSREGGAALANALRRH 
LPFLEALSQAPASDVLARTHTAQDRPPAEGDDRFSESILTYVAHTSALTYPPGPRTQLREDLLPRTLGQLQPDELSP 
KVDSGVDRHHLMAALSAYAAQRPPAPPGEGSLEPQYLLRAPSRMPRPLLAPAAPQKWPSPLGDSEDPSSTGDGARIH 
TLLKDLQRQPAEVRGLSGLELDGMAELMAGLMQGVDHGVARGSPGRAALGESGEQADGPKATLRGDSFPDDGVQDDD 
DRLYQEVHRLSATLGGLLQDHGSRLLPGALPFARPLDMERKKSEHPESSLSSEEETAGVENVKSQTYSKDLLGQQPH 
SEPGAAAFGELQNQMPGPSKEEQSLPAGAQEALSDGLQLEVQPSEEEARGYIVTDRDPLRPEEGRRLVEDVARLLQV 
PSSAFADVEVLGPAVTFPCVSANVQNVTTEDVEKATVDNKDKLEETSGLKILQTGVGSKSKLKFLPPQAEQEDSTKFI 
ALTLVSLACILGVLLASGLIYCLRHSSQHRLKEKLSGLGGDPGADATAAYQELCRQRMATRPPDRPEGPHTSRISSV 
SSQFSDGPIPSPSARSSASSWSEEPVQSNMDISTGHMILSYMEDHLKNKNRLEKEWEALCAYQAEPNSSFVAQREEN 
VPKNRSLAVLTYDHSRVLLKAENSHSHSDYINASPIMDHDPRNPAYIATQGPLPATVADFWQMVWESGCWIVMLTP 
LAENGVRQCYHYWPDEGSNLYHIYEVNLVSEHIWCEDFLVRSFYLKNLQTNETRTVTQFHFLSWYDRGVPSSSRSLL 
DFRRKVNKCYRGRSCPIIVHCSDGAGRSGTYVLIDMVLNKMAKGAKEIDIAATLEHLRDQRPGMVQTKEQFEFALTA 
VAEE VN A I LKAL PQ 

Signal sequence, 
amino acids 1-21 
Transmembrane domain, 
amino acids 616-636 
N-glycosylation sites. 

amino acids 564-567, 760-763, 774-777, 898-901 
Glycosaminoglycan attachment sites, 
amino acids 90-93, 652-655 

cAMP- and cGMP-dependent protein kinase phosphorylation sites, 
amino acids 137-140, 425-428 
N-myristoylation sites. 

amino acids 44-49, 47-52, 91-96, 144-149, 338-343, 342-347, 346-351, 490-495, 

628-633, 634-639, 655-660, 659-664, 852-857, 915-920, 948-953, 987-992 

Amidation site. 

amino acids 525-528 

Cell attachment sequence. 

amino acids 372-374 

Tyrosine specific protein phosphatases active site, 
amino acids 94 3-955 
Protein-tyrosine phosphatase, 
amino acids 770-1004 
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FIGURE 42 

MQPPPSLCGRALVALVLACGLSRIWGEERGFPPDRATPLLQTAEIMTPPTKTLWPKGSNASLARSLAPAEVPKGDRT 
AGSPPRTISPPPCQGPIEIKETFKYINTVVSCLVFVLGIIGNSTLLRIIYKNKCMRNGPNILIASLALGDLLHIVID 
IPINVYKLLAEDWPFGAEMCKLVPFIQKASVGITVLSLCALSIDRYRAVASWSRIKGIGVPKWTAVEIVLIWVVSVV 
LAVPEAIGFDIITMDYKGSYLRICLLHPVQKTAFMQFYKTAKDWWLFSFYFCLPLAITAFFYTLMTCEMLRKKSGMQ 
lALNDHLKQRREVAKTVFCLVLVFALCWLPLHLSRILKLTLYNQNDPNRCELLSFLLVLDYIGINMASLNSCINPIA 
LYLVSKRFKNCFKSCLCCWCQSFEEKQSLEEKQSCLKFKANDHGYDNFRSSNKYSSS 

Transmembrane domains . 

amino acids 1-19, 101-121, 137-157, 177-197, 216-236, 275-295, 323-343, 362-382 

N-glycosylation sites, 
amino acids 59-62, 119-122 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 302-305 

Tyrosine kinase phosphorylation site, 
amino acids 424-430 

N-myristoylation sites. 

amino acids 57-62, 115-120, 170-175, 306-311, 371-376 

7 transmembrane receptor homology, 
amino acids 118-386 



